Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_4328 |
Symbol | |
ID | 8393680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 4473519 |
End bp | 4475174 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644982238 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003139949 |
Protein GI | 257062061 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.310509 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0771259 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAACT GTTTTTCCTT AAATTTTAAA CAATTGGGGC GATTAGGACG GTTTTTTGCC CTGTTTAGTC TATGTTTTTT CTTAACGGTC GCTTGTAACC AACAAAAGAC TAGCGAACCT ATCCAGGGAA CGCCACCGAA TAGCGATCGC ATCACCATCG GAACCATTGC CAAACCCCGC ACCATCGATC CCGCCGATAG CTACGAATTA TCCGGCTTAA TCCTTATTTA TAACCTCACC GATACCCTTT ATAGCTACGA ACTGGGAACC ACAACCCTAA AGCCCCAATT AGCCGCCGAA ATGCCTAAAA TTAGTGCCGA TGGCTTAACC TATACCATCC CCCTCCGTCA AGGAGTCACC TTCCACGACG ACACTCCTTT TAACGCCGAA GCGATGGTAT TTTCCTTAGA ACGCTTCATG AAAAATGGCG GTAAACCCTC CTTCTTGTTA GCAGACACCA TCGACACGGT AAAAGCCACA GGAGACTATG AAATCACCAT TACCCTGAAA AAACCCTTTT CCGCCTTCCC TGCCCTATTA GCCTATCCTG GGGCTGCTGC GGTGTCTCCA AAAGCCTACG AAATTGGGGC AGGAAAGTTT CAACCCGATC GCTTAGTGGG AACCGGTCCT TACAAATTAG CAGCCTTTAG CAGTGATTCA GTGCAGTTAG AGGTCTTTGA CAAATACTGG GGAGAAAAAC CGAAAAACCA GGGAATTAAC CTACAAATTT ACCCCGATAA CCCCGCTAAT TTATTTAATG CCTTTAAAAC AAAAGCCGTT GATGTCGCCT ATCAATCTTT GTTAGCGCAA CAGATCAAAG CCCTCAAAGA ACAAGCCACT CAAGGACAGG GACAAGTTAT TGAAGCCCCA GGAACCGCGA TCGCCTTTAT GGCACTTAAT CTTAACAGTG ACTCCCTCAA AAATAAACCC GTTCGTCAAG CGATCGCCGC TTTAATGAAC CGTCAATTAC TCATAGATCG GGTTTTGCAA GGTCAAGGAG AACCCCTCTA CAGTATGATT CCAAACGCCT TTGAAGCCTC ACAACCCGTC TTTAAAGACC GTTATGAGGA TGCTAATAAA GAAGAAGCCC TAAAATTCCT GACAGAAGCG GGATATTCTG CCGAGAAGCC CGTCCCTGTT GAAATCTGGC ACACTTCTAG TTCCACCAAT GCCAGTCAGG TTGCTGCTAT TCTCAAAGAA TTAGGCAAAC GGGACTTAGG AGGAGTCATT GAATTTCAAC CCAATAGTAT TGCCTCAGCA GCCTTTTTTA AGAACCTTGC CCAAGGATTA TATCCCGCGA CTTTATCCAA TTGGTATCCC GACTTTTTGG ATGCCGATAA CTATATTTAT CCCTTCTTAC ATTGTGCCAA AGGCAGCCCA GAACAAGGGT GTAGTGAAGG AGGATCGCAA GCTCAAGGGT CATTTTATTA CAGCGATCGC ATCAATGAAT TAATCGATCA ACAACGTCGT GAAGCCAACC CCGAAAAACG TCAAGCGATC TTTAAAGAAA TTCAAACTAT TTTAGCCGAA GACGTTCCTT TTATTCCCCT GTGGCAAACC AAAGAATATG CCTTTGCTCA AAATAATATT AATGGGATCA CAATTAATCC TAGTCAAACT TTTCCTTTTT GGACAATTAG TCGGGGAACA AAGTAA
|
Protein sequence | MINCFSLNFK QLGRLGRFFA LFSLCFFLTV ACNQQKTSEP IQGTPPNSDR ITIGTIAKPR TIDPADSYEL SGLILIYNLT DTLYSYELGT TTLKPQLAAE MPKISADGLT YTIPLRQGVT FHDDTPFNAE AMVFSLERFM KNGGKPSFLL ADTIDTVKAT GDYEITITLK KPFSAFPALL AYPGAAAVSP KAYEIGAGKF QPDRLVGTGP YKLAAFSSDS VQLEVFDKYW GEKPKNQGIN LQIYPDNPAN LFNAFKTKAV DVAYQSLLAQ QIKALKEQAT QGQGQVIEAP GTAIAFMALN LNSDSLKNKP VRQAIAALMN RQLLIDRVLQ GQGEPLYSMI PNAFEASQPV FKDRYEDANK EEALKFLTEA GYSAEKPVPV EIWHTSSSTN ASQVAAILKE LGKRDLGGVI EFQPNSIASA AFFKNLAQGL YPATLSNWYP DFLDADNYIY PFLHCAKGSP EQGCSEGGSQ AQGSFYYSDR INELIDQQRR EANPEKRQAI FKEIQTILAE DVPFIPLWQT KEYAFAQNNI NGITINPSQT FPFWTISRGT K
|
| |