Gene PCC8801_2270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2270 
Symbol 
ID7102502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2340816 
End bp2342558 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content39% 
IMG OID643475316 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002372445 
Protein GI218247074 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAAGT CAGTCAACAT TCGCTCTTGT ATTCTACCTG TTTTACTGTC TGTCGGGTCA 
TCTTTTCTAC TCTGTGCTTG TAACCCTCAA CCAGAAACCC CAACCTCTCA GACAACTTCC
CAAAGCGAAA ATGAAACTTT AAAATTATTA TATTGGCAAG CTCCCACCAT TTTGAATCCT
CATCTTTCCA CTGGGTTTAA AGATAGTGAG GCAAGTCGAA TTACCTTAGA ACCCTTAGCC
AGTTATGATA ACAAAGGCCA GTTAATTCTG TTTTTAGCGG CGGAAATTCC CTCAGTAGAG
AATGGAGGAA TAGCCAAGGA TGGTAAATCA GTTATTTGGA AGTTAAAACA GGGGATAAAA
TGGTCAGATG GAACACCATT TACAGCAGCC GATGTTGTGT TTACCCATCA GTTTATTGCT
AATCCTAAAG TTGGAGCTAC CAGTGGGAAT AGTTATTTGA ATGTCGAAAA AGTAGAAGCT
TTAGATGATT ATACTGTAAA AGTGATTTTT AAACAACCGA CTCCGTCTTG GGATATCCCT
TTTGTCGGGG GTGCAGGGAT GATTTTACCC CGTCATCTCT ATGAAAAGTA TAACGGAGAA
AATGCACGAC AAGCTCCTAA TAATTTAATC GCTGTGGGAA CAGGACCTTA TAAAGTTGTG
GACTTTAAAC CAGGAGATGT GGTGGTTTTT GAGGCTAATT CCTATTTCCG AGAAGCGGAT
AAATTAGGGT TTAAACGCAT TGAATTGAAA GGGGGAGGAG ATGCTACCTC TGCTGCGAGA
GCAGTGTTAG AAACGGGGGA TGCAGACTAT GCTTATAATT TGCAGGTAGA AGTCCCTGTA
TTAAAGCAAT TAGAAGCAGC AGGAAAGGGC AAATTAAACT CAGTTTTTGG AGGCAATAGT
GAAAGAATTT TAATCAATTT AAGTGATCCT AATAAAGCAA CAACCGAGGG AGAAAGATCT
AGTCTACAAT TCCCCCATCC CTTGTTTAAA GATCCCAAAG TCAGAGAAGC TTTTACTTTA
GCGGTTGATC GAGATACTGT GGCTCAACAA TTGTATGGAA TTACGGGAAA AGCAACGCCT
AATGTCTTAG TTTCTCCTCC TGAATATAAC TCTCCTAATA CGAAATATGA ATTTAATTTA
GAGAAGGCAG CTAAGTTATT AGATGAGGCT GGCTGGAAGG ATTCTAATAA TAATGGTATT
CGAGACAAGG ATGGGGTAGA AATGCAAATT CTTTTCCAAA CATCCGTGAA TCCTTTGCGT
CAGAAAACCC AAGAAATCAT CAAACAAAGC TTACAACAAA TTGGAGTTGG TGTTGAATTA
AAAAGTATTG ATGCAAGTAT TTTCTTTTCG AGTGATCCGT CTAATAATGA TACCGTTGAA
CGTTTCTATG CCGATTTTCA AATGTTCACC TCTGGTAATT TAAACCCCGA TCCCAGTACC
TACATGAGTA ATTTTACCTG TCAATCCATT CCCCAAAAAG CGAATAATTG GTCAGGGAAT
AATTACGCTC GCTATTGTAA CCCCGAATAT GATAAATTAT GGAAAGAAGC CACCCAAGAA
TTAGATGCCA AAAAACGCCA AGAACTCTTT ATTAAAATGA ATGATCTGTT AGTCAACAAT
TTTGTTCTTA TTCCCTTAGT CCATCGTGCT GATGTAGCAG GAATTAGTAA TCGTTTACAA
GGGTTCGAGT TGACTCCGTG GGACTTTAAT ACTTGGAAGA TTAAAGATTG GAAAAAATCT
TAA
 
Protein sequence
MGKSVNIRSC ILPVLLSVGS SFLLCACNPQ PETPTSQTTS QSENETLKLL YWQAPTILNP 
HLSTGFKDSE ASRITLEPLA SYDNKGQLIL FLAAEIPSVE NGGIAKDGKS VIWKLKQGIK
WSDGTPFTAA DVVFTHQFIA NPKVGATSGN SYLNVEKVEA LDDYTVKVIF KQPTPSWDIP
FVGGAGMILP RHLYEKYNGE NARQAPNNLI AVGTGPYKVV DFKPGDVVVF EANSYFREAD
KLGFKRIELK GGGDATSAAR AVLETGDADY AYNLQVEVPV LKQLEAAGKG KLNSVFGGNS
ERILINLSDP NKATTEGERS SLQFPHPLFK DPKVREAFTL AVDRDTVAQQ LYGITGKATP
NVLVSPPEYN SPNTKYEFNL EKAAKLLDEA GWKDSNNNGI RDKDGVEMQI LFQTSVNPLR
QKTQEIIKQS LQQIGVGVEL KSIDASIFFS SDPSNNDTVE RFYADFQMFT SGNLNPDPST
YMSNFTCQSI PQKANNWSGN NYARYCNPEY DKLWKEATQE LDAKKRQELF IKMNDLLVNN
FVLIPLVHRA DVAGISNRLQ GFELTPWDFN TWKIKDWKKS