Gene RoseRS_2768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2768 
Symbol 
ID5209737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3449464 
End bp3450702 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content62% 
IMG OID640596368 
Productvon Willebrand factor, type A 
Protein accessionYP_001277090 
Protein GI148656885 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.271285 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACCG GCATAACGCT TACGTGCACC TGGGGACGCG CACCGCTGGT TGCCAGCAAG 
GTTCCACAGG TCGCCTATCT GCTTATCGAA GCGCAGGCTG CGGCAATTGC CGAGAAAATG
CCGCTCAATT TCTGCCTGGT GCTCGACCGC TCCGGGTCGA TGCAGGGCGC CAAACTTGCG
GCGCTGAAGG ATGCCGTCAA ACGGGTGATC GAAACCCTGA CCCCGCAGGA CATTGTTGCG
ATTGTGCTGT TCGACGACAC CGTGCAAACG CTGGTTCCTG CGACGTTTGC CACCGACAAA
GCGACGCTGA TTGCACAGGT CGATGCCATC GAAGAAGCTG GCGGCACGGC GATGTCGGGC
GGGATGGCGG CGGGTATTGT GGAACTGCGC AAGAACCATG ACCCCGGACG TGTCGGGGCG
ATGCTGCTGC TGACCGACGG GCAGACATGG GGTGATGAGG ATCGTTGCCG CGCACTCGCC
CAGGAACTCG CCCGTGATGG GGTGCGGATC ACGGCCCTGG GTCTCGGCGC TGAATGGAAC
GAAGCGTTGC TCGACGATAT TGCCGAAGCG ACCGGCGGCA TTTCAGACTA CATCGCCGAT
CCGGCGCAGA TCACGACGTT CTTCCAGCAT GCAGTCCGCA CAGCACAGGG AACGGTTGCC
CGCGACGCAC GTCTGTTGTT GCGTCTGGTG CGTGACGCAA CGCCGCGCGC GGTTTATCGC
GCCAGTCCTG TGATCGCCAA TCTTGGCTAC CAGCCAATCG GCGACAGTGA GATTGCCGTG
CGGCTCGGCG CCATCGAGGC GGAGACACCG TCGAGCATCG TTGTCGATCT GATGGTTCCG
GCGCGCGACG CGGGGAGTTT TCGCATCGCC CAGGCGGAAC TTCACTACAC GCCGGTTGGC
GGCGCAGAAC AGGTCGTCAA ACAGGACCTG CTGCTCGAGT TCGTGACCGA CCCGACTGCG
TCGGCGTATG ATCCGCGTGT GATGAATCTG GTCGAAAAGG TGACGGCGTT CAAATTGCAA
ACACGCGCGC TGGCTGAAGC AGAAGCCGGC AACCTGTCGG GCGCGACGCA AAAACTGCGC
GCGGCTGCGA CACGGTTGCT CGATCTTGGA GAACTTGAAC TGGCGCAGAA AGTCTCCGAG
CAGGCGGCGC ATCTCGAACA GGGGCAGGCG ATCAGCGCCG AAAATCAGAA AGAACTGCGC
TATGCAACCC GTCGCCTGAC GCAAAAACTC GAGGAATAG
 
Protein sequence
MNTGITLTCT WGRAPLVASK VPQVAYLLIE AQAAAIAEKM PLNFCLVLDR SGSMQGAKLA 
ALKDAVKRVI ETLTPQDIVA IVLFDDTVQT LVPATFATDK ATLIAQVDAI EEAGGTAMSG
GMAAGIVELR KNHDPGRVGA MLLLTDGQTW GDEDRCRALA QELARDGVRI TALGLGAEWN
EALLDDIAEA TGGISDYIAD PAQITTFFQH AVRTAQGTVA RDARLLLRLV RDATPRAVYR
ASPVIANLGY QPIGDSEIAV RLGAIEAETP SSIVVDLMVP ARDAGSFRIA QAELHYTPVG
GAEQVVKQDL LLEFVTDPTA SAYDPRVMNL VEKVTAFKLQ TRALAEAEAG NLSGATQKLR
AAATRLLDLG ELELAQKVSE QAAHLEQGQA ISAENQKELR YATRRLTQKL EE