Gene Cpha266_1789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1789 
Symbol 
ID4571151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2040468 
End bp2042186 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content48% 
IMG OID639766372 
Productextracellular solute-binding protein 
Protein accessionYP_912230 
Protein GI119357586 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCCAAC CTGATTTCCG TACAACACTG CGAGTCGCTA CAACGTTGTT GTACTCATCG 
CTCATTGTGT TCATGCTCGC CTCTCTTTCT GCCTGTCGCC AAAACAGTGA AAACTCCCGC
AAAGATCATA TCGTGGCCGG AGTATCGGCC GACTTCGATT ACCTTAACCC CCTGCTCATC
CAGCTCTCCC TCTCAAGGGA GGTGTGTTCG CTGATCTACC CCGCACTGGT CAAACCTGAA
TACAATGAAA AAAAAGGAAC CATTGACTAT CAGCCATCTG CGGCTGAACG ATGGGAATTC
TCTTCCGACG GTAAAACCGT GATCTTTCAC CTCCGCCCTG ATGCCATTTG GGAAGACGGA
AAAAAAGTAA CATCACGCGA TTTCAAGTTT TCTTATTCAC TGTATAAAAA CCCTGCAATT
GCCAGCTCCC GCCAGCACTA TCTTGATGAT CTGCTTTTAC TACCCGACGG ATCTCCTGAT
ATCGACAAAA GCGTAGAAAC GCCTGATGAG ACAACTCTGA TTCTTCATTT CAGAAAACCG
ATGGATCCCG ACATCGTGCT TGACCACTTC AACGATCTCA TGCCCGTCGC CGAACATCTT
TTCAGAACAG TTCCTCCTGG TGAAATCCGA AGCAGAGCTG CAGAACTTCC GATAACAGGA
GCCGGACCGT TCAAGGTTAA AGAGTGGTCT CGTCAGCAGA AACTCGTACT GATTTCCAGT
CCCACTGCTG TTCTTCCCCA TCCTGCAGTA TCCAAAACCA TGACATTCCT TATTGTTCCT
GAATATACCA CCCGCCTGGC CATGCTCAAA TCAGGACAGA TAGACGCCAT GATCTCTTCC
GGCGGCATCA ATCCGAGGGA TGTCCCTGAA CTTCTGAAAA CAAACCCTGA TATCACCATC
AAACCACTGA AAAACCGCTA TTTCGACAGT GTAGTCTGGC TTGCTATCGA TGGCGAGACA
TACAGAAATA CAGGAAGCAT TGTGCCGAAC CGGTTTTTCG GAAACAGGAA AGTTCGCCAG
GCAATGACCT TTGCCATTGA CCGCCAGGCC ATTATTGATG GATTCATGGG ACCTGAACAT
GCCGCTATTG TCAACACATC ACTATCACCG GCATACACAA CAATCAGGGA TACCACAAGC
GAATCGTATG CCTTCAATCC CGAAAAAGCA AAATCGCTGC TGCGCGAGGA GGGGTGGATC
CAGGGACCTG ACGGTATTCT GCAAAAAAAC GGGGTACGAT TCTCTTTTGA ACTTGCCGCA
CAGGTCGGCA ATCCACGTCG AAACTATGCG GCAACCATCA TCCAGCAGAA CTTGCGGGAT
GTCGGTATCG ACTGCCGTCT GAAGTTTGAT GAAAGTCTTG TTTTTCTGAA AAGCCAGAAT
GAATTTCGCT ATGATGCCGC ACTTTCAGGA CTTGCGGCTG AAACTCTGCC GTTCCAGCTT
GTCATCTGGG GTTCGGATTT CTCCTCAAGA CCTTTCAACT CTTCGGCTTT TCAGAACAGG
GAGCTTGATG CCGTTATTGC CGAACTGAGC CAACCACTGG ATCTTCAGAA GAAACAACGG
CTATGGATAA CCTATCAGCA AATTCTGACC GAAGAGCAAC CCCGATCATT TCTCTACTAT
TACGACGAGC TTGAAGGCTT CAGTAAACGC CTTGAAAACG TCAAGGTGAA CCTGATCGCA
ACGCTCTACA ACGCTTACGA ATGGAGTCTG AAAAACTGA
 
Protein sequence
MLQPDFRTTL RVATTLLYSS LIVFMLASLS ACRQNSENSR KDHIVAGVSA DFDYLNPLLI 
QLSLSREVCS LIYPALVKPE YNEKKGTIDY QPSAAERWEF SSDGKTVIFH LRPDAIWEDG
KKVTSRDFKF SYSLYKNPAI ASSRQHYLDD LLLLPDGSPD IDKSVETPDE TTLILHFRKP
MDPDIVLDHF NDLMPVAEHL FRTVPPGEIR SRAAELPITG AGPFKVKEWS RQQKLVLISS
PTAVLPHPAV SKTMTFLIVP EYTTRLAMLK SGQIDAMISS GGINPRDVPE LLKTNPDITI
KPLKNRYFDS VVWLAIDGET YRNTGSIVPN RFFGNRKVRQ AMTFAIDRQA IIDGFMGPEH
AAIVNTSLSP AYTTIRDTTS ESYAFNPEKA KSLLREEGWI QGPDGILQKN GVRFSFELAA
QVGNPRRNYA ATIIQQNLRD VGIDCRLKFD ESLVFLKSQN EFRYDAALSG LAAETLPFQL
VIWGSDFSSR PFNSSAFQNR ELDAVIAELS QPLDLQKKQR LWITYQQILT EEQPRSFLYY
YDELEGFSKR LENVKVNLIA TLYNAYEWSL KN