Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_2270 |
Symbol | |
ID | 7102502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 2340816 |
End bp | 2342558 |
Gene Length | 1743 bp |
Protein Length | 580 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643475316 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002372445 |
Protein GI | 218247074 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAAGT CAGTCAACAT TCGCTCTTGT ATTCTACCTG TTTTACTGTC TGTCGGGTCA TCTTTTCTAC TCTGTGCTTG TAACCCTCAA CCAGAAACCC CAACCTCTCA GACAACTTCC CAAAGCGAAA ATGAAACTTT AAAATTATTA TATTGGCAAG CTCCCACCAT TTTGAATCCT CATCTTTCCA CTGGGTTTAA AGATAGTGAG GCAAGTCGAA TTACCTTAGA ACCCTTAGCC AGTTATGATA ACAAAGGCCA GTTAATTCTG TTTTTAGCGG CGGAAATTCC CTCAGTAGAG AATGGAGGAA TAGCCAAGGA TGGTAAATCA GTTATTTGGA AGTTAAAACA GGGGATAAAA TGGTCAGATG GAACACCATT TACAGCAGCC GATGTTGTGT TTACCCATCA GTTTATTGCT AATCCTAAAG TTGGAGCTAC CAGTGGGAAT AGTTATTTGA ATGTCGAAAA AGTAGAAGCT TTAGATGATT ATACTGTAAA AGTGATTTTT AAACAACCGA CTCCGTCTTG GGATATCCCT TTTGTCGGGG GTGCAGGGAT GATTTTACCC CGTCATCTCT ATGAAAAGTA TAACGGAGAA AATGCACGAC AAGCTCCTAA TAATTTAATC GCTGTGGGAA CAGGACCTTA TAAAGTTGTG GACTTTAAAC CAGGAGATGT GGTGGTTTTT GAGGCTAATT CCTATTTCCG AGAAGCGGAT AAATTAGGGT TTAAACGCAT TGAATTGAAA GGGGGAGGAG ATGCTACCTC TGCTGCGAGA GCAGTGTTAG AAACGGGGGA TGCAGACTAT GCTTATAATT TGCAGGTAGA AGTCCCTGTA TTAAAGCAAT TAGAAGCAGC AGGAAAGGGC AAATTAAACT CAGTTTTTGG AGGCAATAGT GAAAGAATTT TAATCAATTT AAGTGATCCT AATAAAGCAA CAACCGAGGG AGAAAGATCT AGTCTACAAT TCCCCCATCC CTTGTTTAAA GATCCCAAAG TCAGAGAAGC TTTTACTTTA GCGGTTGATC GAGATACTGT GGCTCAACAA TTGTATGGAA TTACGGGAAA AGCAACGCCT AATGTCTTAG TTTCTCCTCC TGAATATAAC TCTCCTAATA CGAAATATGA ATTTAATTTA GAGAAGGCAG CTAAGTTATT AGATGAGGCT GGCTGGAAGG ATTCTAATAA TAATGGTATT CGAGACAAGG ATGGGGTAGA AATGCAAATT CTTTTCCAAA CATCCGTGAA TCCTTTGCGT CAGAAAACCC AAGAAATCAT CAAACAAAGC TTACAACAAA TTGGAGTTGG TGTTGAATTA AAAAGTATTG ATGCAAGTAT TTTCTTTTCG AGTGATCCGT CTAATAATGA TACCGTTGAA CGTTTCTATG CCGATTTTCA AATGTTCACC TCTGGTAATT TAAACCCCGA TCCCAGTACC TACATGAGTA ATTTTACCTG TCAATCCATT CCCCAAAAAG CGAATAATTG GTCAGGGAAT AATTACGCTC GCTATTGTAA CCCCGAATAT GATAAATTAT GGAAAGAAGC CACCCAAGAA TTAGATGCCA AAAAACGCCA AGAACTCTTT ATTAAAATGA ATGATCTGTT AGTCAACAAT TTTGTTCTTA TTCCCTTAGT CCATCGTGCT GATGTAGCAG GAATTAGTAA TCGTTTACAA GGGTTCGAGT TGACTCCGTG GGACTTTAAT ACTTGGAAGA TTAAAGATTG GAAAAAATCT TAA
|
Protein sequence | MGKSVNIRSC ILPVLLSVGS SFLLCACNPQ PETPTSQTTS QSENETLKLL YWQAPTILNP HLSTGFKDSE ASRITLEPLA SYDNKGQLIL FLAAEIPSVE NGGIAKDGKS VIWKLKQGIK WSDGTPFTAA DVVFTHQFIA NPKVGATSGN SYLNVEKVEA LDDYTVKVIF KQPTPSWDIP FVGGAGMILP RHLYEKYNGE NARQAPNNLI AVGTGPYKVV DFKPGDVVVF EANSYFREAD KLGFKRIELK GGGDATSAAR AVLETGDADY AYNLQVEVPV LKQLEAAGKG KLNSVFGGNS ERILINLSDP NKATTEGERS SLQFPHPLFK DPKVREAFTL AVDRDTVAQQ LYGITGKATP NVLVSPPEYN SPNTKYEFNL EKAAKLLDEA GWKDSNNNGI RDKDGVEMQI LFQTSVNPLR QKTQEIIKQS LQQIGVGVEL KSIDASIFFS SDPSNNDTVE RFYADFQMFT SGNLNPDPST YMSNFTCQSI PQKANNWSGN NYARYCNPEY DKLWKEATQE LDAKKRQELF IKMNDLLVNN FVLIPLVHRA DVAGISNRLQ GFELTPWDFN TWKIKDWKKS
|
| |