Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0567 |
Symbol | |
ID | 4569162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 634030 |
End bp | 635856 |
Gene Length | 1827 bp |
Protein Length | 608 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 639765165 |
Product | extracellular solute-binding protein |
Protein accession | YP_911047 |
Protein GI | 119356403 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCATGT TTACTAAACG GGAAGTAACA GTGCCTGCAT GTTCTAATAG CCGGCCATTT CGTTTCTCGC TGAGCGCGTT AATTCTTTTT ATGACGGTGC TGGCTTCCAC TTCCTGCAGC GAAAAAAAAC AGGGCGATGA CATCGGAAGC AAAGGTCGTG CAGCAAAAGA TTCAACGCTG GTTATTGCAA TGCTGGGGGA TGCTGATTAT TTGAATCCCG TGCTTGGAAC AACGGTGACC TCGAACAATA TTTTCAGTCT CATCTATCCG GGTCTCTTGC AAAGCGAGTT TGATACGACC ACTGGTTTGC TGAATTTTAT CGCGCTTGAA AAACGGTTGA GGCAAACAGG CACCGGTACC GGGAAAAAAA CGCCACGCGC TGCTCTTGCA AAAACCTGGC GGATGGCTCC GGATCATAAA TCCATTACCT ATATTCTCAG AAACAACGCA TTCTGGAACG ATGGCAAGCC GATTGTTTCC GGAGATTTTA AGTTTTCCTA TAAGCTGTAT GGTAATCCCG TTATTGCAAG TGCTCGTCAG CAGTACCTTT CCGAGCTGAT CGGCGCTGAA ACCGGGCAGG TTGATTTTCG GAGGGCTATC GAAACACCTG ATGACACGAC ATTGATTTTC AGGTTTCATA AGCCTGTTTC TGAACAGCTT GCGCTTTTTC ATACCTCGCT GACTCCTTTG CCTTCACATT ACTGGAGGTC GGTAAAGCCG GAGGATTTCA GAAGCTCGCC ACTCAATCAG TTACCGCTTG GCGCAGGGCC ATACAAGTTG CAGGTTTGGC GGCAGCAGCA GGAAATTGTG CTTGCTTCAA ACAAGAGAAG TAATCTGCCT AAGCCAGGCA ATATCCCCTA TATTTCCTAT CGTGTTGTGC CGGATTATAC GGTAAGATTA ACTCAGCTTC AGACGAATGC TGTTGATGTT GTTGAAAATA TTAAACCTGA GGATTTTCAG GGGGTTCTGA AATCCAACGC TGCAATTGAG ATTAAAACTG TCGGACTCAG GGTTTTTGAC TATGTAGGCT GGTCAAATAT TGATCAGGCC GAGTATCACA AAACCGGAAA AATCAAACCC CATCCGCTTT TTGGTTCTGC ACAGGTTCGC CTTGCGCTTA CAACGGCTAT TGACAGAGAG TCGATCATTG ATGGTTATCT CAAGAGCTAT GGCGTTCTTT GCAATACCGA TATTTCACCT TCGCTGAAAT GGGCGTACAA TAGAGCTATT CTTGCTCATC AGTTCGATCC CGCAAAAGCT TCGGCACTGC TCAAAGCCGA AGGCTGGCTT CCGGGACCTG ACGGTATTCT TCGAAAAAAC GGAAGGAAAT TCAGTTTTGT ACTTTACACC AATTCCGGCA ATGCCCGGAG AAATTATGCG AGCGTCATCA TCCAGCAGAA TCTGAAGGCG ATCGGCATTG ACTGCAAGCT TGACGTTCAG GAATCCAATG TCTTTTTTGA AAATCTTCAG TCGAGAAAAC TTGATGCATG GATGGCCGGC TGGTCTATAG GGCTTGAAAT TGATCCTCTT GATGTCTGGG GTTCCGATCT CAAAAAAAGC CGATTTAATT TTACCGGCTA TCAAAACCCG AGAATTGACG GACTTTGTGA GCTTGCGAAA CAGAAGATGG ATCCACTGGA AGCGAAAGCG TACTGGATGG AATATCAGCA AATTCTTCAT CGCGATCAGC CGGTCACATT TTTGTACTGG ATAAGGGAAA CGCAAGGTTT CAATAAAAGA ATTCAGGGCG AAGAGCTTAA TATTTCAGGA ACCTTTTACA ATATTGACGA CTGGACTCTT AACCCTTCGG CAACTGTGGC TCTTTAA
|
Protein sequence | MTMFTKREVT VPACSNSRPF RFSLSALILF MTVLASTSCS EKKQGDDIGS KGRAAKDSTL VIAMLGDADY LNPVLGTTVT SNNIFSLIYP GLLQSEFDTT TGLLNFIALE KRLRQTGTGT GKKTPRAALA KTWRMAPDHK SITYILRNNA FWNDGKPIVS GDFKFSYKLY GNPVIASARQ QYLSELIGAE TGQVDFRRAI ETPDDTTLIF RFHKPVSEQL ALFHTSLTPL PSHYWRSVKP EDFRSSPLNQ LPLGAGPYKL QVWRQQQEIV LASNKRSNLP KPGNIPYISY RVVPDYTVRL TQLQTNAVDV VENIKPEDFQ GVLKSNAAIE IKTVGLRVFD YVGWSNIDQA EYHKTGKIKP HPLFGSAQVR LALTTAIDRE SIIDGYLKSY GVLCNTDISP SLKWAYNRAI LAHQFDPAKA SALLKAEGWL PGPDGILRKN GRKFSFVLYT NSGNARRNYA SVIIQQNLKA IGIDCKLDVQ ESNVFFENLQ SRKLDAWMAG WSIGLEIDPL DVWGSDLKKS RFNFTGYQNP RIDGLCELAK QKMDPLEAKA YWMEYQQILH RDQPVTFLYW IRETQGFNKR IQGEELNISG TFYNIDDWTL NPSATVAL
|
| |