Gene RoseRS_2165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2165 
Symbol 
ID5209127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2665494 
End bp2666699 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content62% 
IMG OID640595766 
ProductVWA containing CoxE family protein 
Protein accessionYP_001276495 
Protein GI148656290 
COG category[R] General function prediction only 
COG ID[COG3552] Protein containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAGT TAGAACCACT GATTCCCCAC GGGAACCTGA TGACGCATGT GGTTGCGTTC 
GTGCACCTGC TGCGCAGCAC CGGCATCAAA GTCAGCAGCG AACAGACCAT CGATCTGGCG
CGGGCGCTGG AGCATGTGCC GATTGTGGCG CGTGGGGATT TTCGCGCAGC TGCGCGCTGT
ACCCTGATCT GCCGACGCGA AGATCTGCCG ATGTTCGATG CTGCGTTCGA TTTCTACTGG
CGCACGCAGT CAGGGTTTGA TCCGTTGATG CTGGCGATCC CGGTGGTCAA GATGCCGCCG
AAACCGCTGC GTCTGCCGCG CCGACCGCGC AGCCAGGGCG ATGGACACAA TGAGCCGGAT
CGTCATGAAG AACAGCAGGA AGAGAAGGTT GGCTTTACGC TCACCTTTAC GGCTGCTGAG
ACGTTGCGCA CCAAAGACTT CGGCAACTTC AGTTACGAAG AGGTGCAGGC GTGCAAGGAG
TTGCTACGCA CACTCGAGTG GCGCATCGAG CCGCGTCGCA CCCGTCGTCG TCGCCCGGCA
GTGCGCGCCG GCGAGATCGA TATGCGCCGC ATCCTGCGCC GCAACCTGCG CCACGGCGGC
GACCCGATTG AGTTGACCTT CCGCGAGCCG CGCTATCGGC AGCGTCCGCT CGTCGTGCTG
TGCGACATCA GCGGTTCGAT GGATCGCTAC AGTCGTATCC TGCTTCAATT CGTGCATACT
ATCTCGAACG GCTTGCGTGA CGTGGAAGCG TTCGTATTCG GCACGCGCCT GACGCGCATT
ACCCGTCTGT TGCGTGAACG CGATATCGAT GAAGCCATCG CAGCCGTCAG CAAACATGTG
GTGGACTGGT CGGGCGGGAC GCGGATTGGC GAGGCGGTCA GGCACTTCAA TTACTACTGG
TCGCGCCGGG TGCTGGGGCG CGGTCCGGTG GTGTTGCTCA TCAGCGACGG ATGGGATCGC
GGCGATCCGC AGTTGCTGGG GCGTGAAATG GCGCGGCTGC AACGTTCATG CTACCGCCTG
ATCTGGTTGA ACCCGTTGCT GGGGAACCCG CGCTATCAAC CGCTCACCCA GGGGATGCAG
GCGGCGCTGC CGTTTGTCGA TGACTTTTTG CCGGTGCACA ACCTGGTAAG CCTGGAGCAA
CTCGGCGCAA AACTGGCGAT GCTTGGCGCG CGCCGCCCTG AGCGACGCCA GCGGATTGGA
ACCTAG
 
Protein sequence
MDELEPLIPH GNLMTHVVAF VHLLRSTGIK VSSEQTIDLA RALEHVPIVA RGDFRAAARC 
TLICRREDLP MFDAAFDFYW RTQSGFDPLM LAIPVVKMPP KPLRLPRRPR SQGDGHNEPD
RHEEQQEEKV GFTLTFTAAE TLRTKDFGNF SYEEVQACKE LLRTLEWRIE PRRTRRRRPA
VRAGEIDMRR ILRRNLRHGG DPIELTFREP RYRQRPLVVL CDISGSMDRY SRILLQFVHT
ISNGLRDVEA FVFGTRLTRI TRLLRERDID EAIAAVSKHV VDWSGGTRIG EAVRHFNYYW
SRRVLGRGPV VLLISDGWDR GDPQLLGREM ARLQRSCYRL IWLNPLLGNP RYQPLTQGMQ
AALPFVDDFL PVHNLVSLEQ LGAKLAMLGA RRPERRQRIG T