Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1789 |
Symbol | |
ID | 4571151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 2040468 |
End bp | 2042186 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 639766372 |
Product | extracellular solute-binding protein |
Protein accession | YP_912230 |
Protein GI | 119357586 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCCAAC CTGATTTCCG TACAACACTG CGAGTCGCTA CAACGTTGTT GTACTCATCG CTCATTGTGT TCATGCTCGC CTCTCTTTCT GCCTGTCGCC AAAACAGTGA AAACTCCCGC AAAGATCATA TCGTGGCCGG AGTATCGGCC GACTTCGATT ACCTTAACCC CCTGCTCATC CAGCTCTCCC TCTCAAGGGA GGTGTGTTCG CTGATCTACC CCGCACTGGT CAAACCTGAA TACAATGAAA AAAAAGGAAC CATTGACTAT CAGCCATCTG CGGCTGAACG ATGGGAATTC TCTTCCGACG GTAAAACCGT GATCTTTCAC CTCCGCCCTG ATGCCATTTG GGAAGACGGA AAAAAAGTAA CATCACGCGA TTTCAAGTTT TCTTATTCAC TGTATAAAAA CCCTGCAATT GCCAGCTCCC GCCAGCACTA TCTTGATGAT CTGCTTTTAC TACCCGACGG ATCTCCTGAT ATCGACAAAA GCGTAGAAAC GCCTGATGAG ACAACTCTGA TTCTTCATTT CAGAAAACCG ATGGATCCCG ACATCGTGCT TGACCACTTC AACGATCTCA TGCCCGTCGC CGAACATCTT TTCAGAACAG TTCCTCCTGG TGAAATCCGA AGCAGAGCTG CAGAACTTCC GATAACAGGA GCCGGACCGT TCAAGGTTAA AGAGTGGTCT CGTCAGCAGA AACTCGTACT GATTTCCAGT CCCACTGCTG TTCTTCCCCA TCCTGCAGTA TCCAAAACCA TGACATTCCT TATTGTTCCT GAATATACCA CCCGCCTGGC CATGCTCAAA TCAGGACAGA TAGACGCCAT GATCTCTTCC GGCGGCATCA ATCCGAGGGA TGTCCCTGAA CTTCTGAAAA CAAACCCTGA TATCACCATC AAACCACTGA AAAACCGCTA TTTCGACAGT GTAGTCTGGC TTGCTATCGA TGGCGAGACA TACAGAAATA CAGGAAGCAT TGTGCCGAAC CGGTTTTTCG GAAACAGGAA AGTTCGCCAG GCAATGACCT TTGCCATTGA CCGCCAGGCC ATTATTGATG GATTCATGGG ACCTGAACAT GCCGCTATTG TCAACACATC ACTATCACCG GCATACACAA CAATCAGGGA TACCACAAGC GAATCGTATG CCTTCAATCC CGAAAAAGCA AAATCGCTGC TGCGCGAGGA GGGGTGGATC CAGGGACCTG ACGGTATTCT GCAAAAAAAC GGGGTACGAT TCTCTTTTGA ACTTGCCGCA CAGGTCGGCA ATCCACGTCG AAACTATGCG GCAACCATCA TCCAGCAGAA CTTGCGGGAT GTCGGTATCG ACTGCCGTCT GAAGTTTGAT GAAAGTCTTG TTTTTCTGAA AAGCCAGAAT GAATTTCGCT ATGATGCCGC ACTTTCAGGA CTTGCGGCTG AAACTCTGCC GTTCCAGCTT GTCATCTGGG GTTCGGATTT CTCCTCAAGA CCTTTCAACT CTTCGGCTTT TCAGAACAGG GAGCTTGATG CCGTTATTGC CGAACTGAGC CAACCACTGG ATCTTCAGAA GAAACAACGG CTATGGATAA CCTATCAGCA AATTCTGACC GAAGAGCAAC CCCGATCATT TCTCTACTAT TACGACGAGC TTGAAGGCTT CAGTAAACGC CTTGAAAACG TCAAGGTGAA CCTGATCGCA ACGCTCTACA ACGCTTACGA ATGGAGTCTG AAAAACTGA
|
Protein sequence | MLQPDFRTTL RVATTLLYSS LIVFMLASLS ACRQNSENSR KDHIVAGVSA DFDYLNPLLI QLSLSREVCS LIYPALVKPE YNEKKGTIDY QPSAAERWEF SSDGKTVIFH LRPDAIWEDG KKVTSRDFKF SYSLYKNPAI ASSRQHYLDD LLLLPDGSPD IDKSVETPDE TTLILHFRKP MDPDIVLDHF NDLMPVAEHL FRTVPPGEIR SRAAELPITG AGPFKVKEWS RQQKLVLISS PTAVLPHPAV SKTMTFLIVP EYTTRLAMLK SGQIDAMISS GGINPRDVPE LLKTNPDITI KPLKNRYFDS VVWLAIDGET YRNTGSIVPN RFFGNRKVRQ AMTFAIDRQA IIDGFMGPEH AAIVNTSLSP AYTTIRDTTS ESYAFNPEKA KSLLREEGWI QGPDGILQKN GVRFSFELAA QVGNPRRNYA ATIIQQNLRD VGIDCRLKFD ESLVFLKSQN EFRYDAALSG LAAETLPFQL VIWGSDFSSR PFNSSAFQNR ELDAVIAELS QPLDLQKKQR LWITYQQILT EEQPRSFLYY YDELEGFSKR LENVKVNLIA TLYNAYEWSL KN
|
| |