Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2529 |
Symbol | |
ID | 5540011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 3262077 |
End bp | 3263027 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640894660 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001432627 |
Protein GI | 156742498 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00436468 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTGCGAT CTCTTCGGTT TTTTGTCGCG TTCACCACGC TGGTCGGTCT GGTTCTTGCT GCATGCAGTG CACCCGCTCA AACCACACAA CCCACGGCTG TACCGCAACC CGCTCCCGCA GATGCCACTC CGGTCCGCAT TGGCTCGAAG AACTTCACCG AAGCGATTCT GGTTGCCGAA ATGTATGCGC TGGCGCTGGA AGATGCCGGC ATTCGCGTCG AGCGCAAGTT CAACCTCGGT GCAACGCCAG TGGCGCACAC GGCGCTGGTG AATGGCGAAA TCGATCTGTA CCCGGAGTAC ACGTCGACCG GTCTGCTCGA AGTGCTCAAG CAAGCGCCGA TTGCCGACGC CAGAGGCATT CTGGAGGCGG TGCGCAAGGG GTACGAAGAG CAATTCCAGG TGACCTGGCT CGAACCATCG CCATTCAACA ACACGAATGC GCTGGCAATG ACCCGGCAGC GCGCTGAAGA ACTGGGGATC AGAACCTACT CCGATCTGGT AGCGCATTCT GGCGATCTGA AACTTGGCGG TCCGCCGGAG TTTCCCGAGC GTGAGGACAC CAAAGGTTTG ATGGCTGCCT ATGGGTTCGA TCCGAAGTTT ATCGAAGAGA ACTTCGTGCA ACTCGACACC GGCGCATTGC GCTACGAGGC GCTTACCAAA GGTGACATCG ATGTGGTCGT CGCATTCGGC ACCGACGGGC AGATTAATGG GTTGGGTCTG GCGCTGCTGG AGGACGATAA GAACTACTAC CCCATCTATC AGATTGCGCC GGTCATTCGC CAGGATGCCC TGGCAGCCAA CCCACAGATT GCCGAGACGC TCAACCGGTT GGCGCCGCTC CTGACGAATG ATGTCATGTC CGGTTTGAAC TGGCAGGTCG ATGGACCGGA GAAGAAGGAG ATCGCCGACG TGGCGCGCAC CTTCCTGCAA CAACAGGGAT TTATCAAGTA G
|
Protein sequence | MVRSLRFFVA FTTLVGLVLA ACSAPAQTTQ PTAVPQPAPA DATPVRIGSK NFTEAILVAE MYALALEDAG IRVERKFNLG ATPVAHTALV NGEIDLYPEY TSTGLLEVLK QAPIADARGI LEAVRKGYEE QFQVTWLEPS PFNNTNALAM TRQRAEELGI RTYSDLVAHS GDLKLGGPPE FPEREDTKGL MAAYGFDPKF IEENFVQLDT GALRYEALTK GDIDVVVAFG TDGQINGLGL ALLEDDKNYY PIYQIAPVIR QDALAANPQI AETLNRLAPL LTNDVMSGLN WQVDGPEKKE IADVARTFLQ QQGFIK
|
| |