Gene Cpha266_0567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0567 
Symbol 
ID4569162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp634030 
End bp635856 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content46% 
IMG OID639765165 
Productextracellular solute-binding protein 
Protein accessionYP_911047 
Protein GI119356403 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATGT TTACTAAACG GGAAGTAACA GTGCCTGCAT GTTCTAATAG CCGGCCATTT 
CGTTTCTCGC TGAGCGCGTT AATTCTTTTT ATGACGGTGC TGGCTTCCAC TTCCTGCAGC
GAAAAAAAAC AGGGCGATGA CATCGGAAGC AAAGGTCGTG CAGCAAAAGA TTCAACGCTG
GTTATTGCAA TGCTGGGGGA TGCTGATTAT TTGAATCCCG TGCTTGGAAC AACGGTGACC
TCGAACAATA TTTTCAGTCT CATCTATCCG GGTCTCTTGC AAAGCGAGTT TGATACGACC
ACTGGTTTGC TGAATTTTAT CGCGCTTGAA AAACGGTTGA GGCAAACAGG CACCGGTACC
GGGAAAAAAA CGCCACGCGC TGCTCTTGCA AAAACCTGGC GGATGGCTCC GGATCATAAA
TCCATTACCT ATATTCTCAG AAACAACGCA TTCTGGAACG ATGGCAAGCC GATTGTTTCC
GGAGATTTTA AGTTTTCCTA TAAGCTGTAT GGTAATCCCG TTATTGCAAG TGCTCGTCAG
CAGTACCTTT CCGAGCTGAT CGGCGCTGAA ACCGGGCAGG TTGATTTTCG GAGGGCTATC
GAAACACCTG ATGACACGAC ATTGATTTTC AGGTTTCATA AGCCTGTTTC TGAACAGCTT
GCGCTTTTTC ATACCTCGCT GACTCCTTTG CCTTCACATT ACTGGAGGTC GGTAAAGCCG
GAGGATTTCA GAAGCTCGCC ACTCAATCAG TTACCGCTTG GCGCAGGGCC ATACAAGTTG
CAGGTTTGGC GGCAGCAGCA GGAAATTGTG CTTGCTTCAA ACAAGAGAAG TAATCTGCCT
AAGCCAGGCA ATATCCCCTA TATTTCCTAT CGTGTTGTGC CGGATTATAC GGTAAGATTA
ACTCAGCTTC AGACGAATGC TGTTGATGTT GTTGAAAATA TTAAACCTGA GGATTTTCAG
GGGGTTCTGA AATCCAACGC TGCAATTGAG ATTAAAACTG TCGGACTCAG GGTTTTTGAC
TATGTAGGCT GGTCAAATAT TGATCAGGCC GAGTATCACA AAACCGGAAA AATCAAACCC
CATCCGCTTT TTGGTTCTGC ACAGGTTCGC CTTGCGCTTA CAACGGCTAT TGACAGAGAG
TCGATCATTG ATGGTTATCT CAAGAGCTAT GGCGTTCTTT GCAATACCGA TATTTCACCT
TCGCTGAAAT GGGCGTACAA TAGAGCTATT CTTGCTCATC AGTTCGATCC CGCAAAAGCT
TCGGCACTGC TCAAAGCCGA AGGCTGGCTT CCGGGACCTG ACGGTATTCT TCGAAAAAAC
GGAAGGAAAT TCAGTTTTGT ACTTTACACC AATTCCGGCA ATGCCCGGAG AAATTATGCG
AGCGTCATCA TCCAGCAGAA TCTGAAGGCG ATCGGCATTG ACTGCAAGCT TGACGTTCAG
GAATCCAATG TCTTTTTTGA AAATCTTCAG TCGAGAAAAC TTGATGCATG GATGGCCGGC
TGGTCTATAG GGCTTGAAAT TGATCCTCTT GATGTCTGGG GTTCCGATCT CAAAAAAAGC
CGATTTAATT TTACCGGCTA TCAAAACCCG AGAATTGACG GACTTTGTGA GCTTGCGAAA
CAGAAGATGG ATCCACTGGA AGCGAAAGCG TACTGGATGG AATATCAGCA AATTCTTCAT
CGCGATCAGC CGGTCACATT TTTGTACTGG ATAAGGGAAA CGCAAGGTTT CAATAAAAGA
ATTCAGGGCG AAGAGCTTAA TATTTCAGGA ACCTTTTACA ATATTGACGA CTGGACTCTT
AACCCTTCGG CAACTGTGGC TCTTTAA
 
Protein sequence
MTMFTKREVT VPACSNSRPF RFSLSALILF MTVLASTSCS EKKQGDDIGS KGRAAKDSTL 
VIAMLGDADY LNPVLGTTVT SNNIFSLIYP GLLQSEFDTT TGLLNFIALE KRLRQTGTGT
GKKTPRAALA KTWRMAPDHK SITYILRNNA FWNDGKPIVS GDFKFSYKLY GNPVIASARQ
QYLSELIGAE TGQVDFRRAI ETPDDTTLIF RFHKPVSEQL ALFHTSLTPL PSHYWRSVKP
EDFRSSPLNQ LPLGAGPYKL QVWRQQQEIV LASNKRSNLP KPGNIPYISY RVVPDYTVRL
TQLQTNAVDV VENIKPEDFQ GVLKSNAAIE IKTVGLRVFD YVGWSNIDQA EYHKTGKIKP
HPLFGSAQVR LALTTAIDRE SIIDGYLKSY GVLCNTDISP SLKWAYNRAI LAHQFDPAKA
SALLKAEGWL PGPDGILRKN GRKFSFVLYT NSGNARRNYA SVIIQQNLKA IGIDCKLDVQ
ESNVFFENLQ SRKLDAWMAG WSIGLEIDPL DVWGSDLKKS RFNFTGYQNP RIDGLCELAK
QKMDPLEAKA YWMEYQQILH RDQPVTFLYW IRETQGFNKR IQGEELNISG TFYNIDDWTL
NPSATVAL