Gene Rcas_2529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2529 
Symbol 
ID5540011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3262077 
End bp3263027 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content58% 
IMG OID640894660 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001432627 
Protein GI156742498 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00436468 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTGCGAT CTCTTCGGTT TTTTGTCGCG TTCACCACGC TGGTCGGTCT GGTTCTTGCT 
GCATGCAGTG CACCCGCTCA AACCACACAA CCCACGGCTG TACCGCAACC CGCTCCCGCA
GATGCCACTC CGGTCCGCAT TGGCTCGAAG AACTTCACCG AAGCGATTCT GGTTGCCGAA
ATGTATGCGC TGGCGCTGGA AGATGCCGGC ATTCGCGTCG AGCGCAAGTT CAACCTCGGT
GCAACGCCAG TGGCGCACAC GGCGCTGGTG AATGGCGAAA TCGATCTGTA CCCGGAGTAC
ACGTCGACCG GTCTGCTCGA AGTGCTCAAG CAAGCGCCGA TTGCCGACGC CAGAGGCATT
CTGGAGGCGG TGCGCAAGGG GTACGAAGAG CAATTCCAGG TGACCTGGCT CGAACCATCG
CCATTCAACA ACACGAATGC GCTGGCAATG ACCCGGCAGC GCGCTGAAGA ACTGGGGATC
AGAACCTACT CCGATCTGGT AGCGCATTCT GGCGATCTGA AACTTGGCGG TCCGCCGGAG
TTTCCCGAGC GTGAGGACAC CAAAGGTTTG ATGGCTGCCT ATGGGTTCGA TCCGAAGTTT
ATCGAAGAGA ACTTCGTGCA ACTCGACACC GGCGCATTGC GCTACGAGGC GCTTACCAAA
GGTGACATCG ATGTGGTCGT CGCATTCGGC ACCGACGGGC AGATTAATGG GTTGGGTCTG
GCGCTGCTGG AGGACGATAA GAACTACTAC CCCATCTATC AGATTGCGCC GGTCATTCGC
CAGGATGCCC TGGCAGCCAA CCCACAGATT GCCGAGACGC TCAACCGGTT GGCGCCGCTC
CTGACGAATG ATGTCATGTC CGGTTTGAAC TGGCAGGTCG ATGGACCGGA GAAGAAGGAG
ATCGCCGACG TGGCGCGCAC CTTCCTGCAA CAACAGGGAT TTATCAAGTA G
 
Protein sequence
MVRSLRFFVA FTTLVGLVLA ACSAPAQTTQ PTAVPQPAPA DATPVRIGSK NFTEAILVAE 
MYALALEDAG IRVERKFNLG ATPVAHTALV NGEIDLYPEY TSTGLLEVLK QAPIADARGI
LEAVRKGYEE QFQVTWLEPS PFNNTNALAM TRQRAEELGI RTYSDLVAHS GDLKLGGPPE
FPEREDTKGL MAAYGFDPKF IEENFVQLDT GALRYEALTK GDIDVVVAFG TDGQINGLGL
ALLEDDKNYY PIYQIAPVIR QDALAANPQI AETLNRLAPL LTNDVMSGLN WQVDGPEKKE
IADVARTFLQ QQGFIK