Gene Cyan8802_0366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_0366 
Symbol 
ID8389671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp352455 
End bp354278 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content36% 
IMG OID644978405 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003136162 
Protein GI257058274 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACTTA TTAAACCCCT TGAAGAATTA AAAATTTTGT CAATTCTGTT ATCTTTGTTG 
ACCAAAACTT CTGAATTGTA CAAGAGATCT ATTTTGAGAC GAACTTTAGC TTTTCTTTTG
GCCTGTTTCT GTTTATTGCC TTTAGTTGGT TGCCAGGCAA ACGCTAATAC GAACCAAGTG
GTTTTTGCTG TTTTAAGTGA TCCTAAAACC TTTAATGCAG TCCTTTCAGC AGAGTCTCCT
AATATTTTTC CGTTGACTTA TGAAGGGTTA ATTACCGAAA ATCCTTTAAC GGGAATTAAA
GAACCTTCTT TAGCAGAGTC TTGGGAATTT TCTGAGGATA AATTAACCAT TATTTTTACG
TTACGAGAAG GCTTAAAATG GTCTGATGGA CAACCTTTAA CGGCTGATGA TGTCGTTTTT
AGTTACAATG ATTTATACCT TAATCCGAAA ATTCCAAATA ATTATCGAGA TAGTTTTAGA
GTCGGTCGAA GTGGAACTTT TCCAGAGATT AAAAAGCTTG ATGATAGACG AATACAATTT
AAAATTACTG AACCCTTTGC CCCCTTTTTA GATGGGGCAG AAGTGCCCAT TTTACCAGCC
CATATTTTAC GCAAAACCTT AGAAAAAAAG AATAAGGACG GAAATCCTGA ATTTTTATCC
ACTTGGGGAA CCGATACTCC TCCCAAAGAT ATTATTGTTA ACGGACCCTA TAAACTCAAA
GACTATGTTA CCAGTCAGCG AATTATTTTT GAGAAAAATC CCTATTATTG GAAAAAAGAT
GAAACAGGAA AACAATTGCC GAATATTGAA CAAATCATTT GGGCAATTGT AGAATCAACG
GATACTTCTC TATTGCAGTT TCGGTCAGGG AGTTTAGATT CTGTTAGTAT TACCCCTGAA
TATTTCTCCT TATTAAAACG GGAAGAAGAC CGAGGTAATT TTACCATTTA TAATGGAGGA
CCCGCCTACG GTACAGTTTT TATTTCCTTT AATCTCAATA AAGGAAAACG GGATGGAAAA
CCCTTAGTTG ATCCCCTAAA ATCAGAATGG TTTAATAATT TAAACTTTAG AAAAGCGGTT
GCCTATGGCA TTGATCGCCC TCGGATCATT AATAATATTT ATCGAGGGTT AGGCGGTCTA
CAAAATACTC AAATTTCAGT GCAATCTCCC TATTATGATA AAACCATTAA GGGTTATGAT
TTTAATATCG AAAAAGCCAA AGCATTACTC GAAAAAGAAG GGTTTAAATT GAATAGCAAA
GGAGAATTAT TAGATAAAAA TGGCAATCGA GTTCAATTTG GTTTAATTAC CAATGCTGGT
AATAAAATTC GGGAAGCAAT GGGAGCACAA ATTAAGGAAG ATTTAGGCAA GCTAGGAATG
CAGGTTGATT TTACGCCCCT AGACTTTAAT ACCTTAGTAG GAAAGCTAAG TAATACGTTA
GATTGGGATG CCCATATTCT CGGTTTTACG GGAGGAAATG AACCCCACGC GCCGAATATT
TGGTATACCG ATGGTAATCT ACATATGTTT AATCAACAAC CCCAACCTGG ACAAAAACCC
ATTACAGGAT GGGTTGCTGC TGACTGGGAG AAAGAAATTG AACAAATTTA TGTAGAAGGT
TCCCAGGAAG TAGACCAACA AAAACGCAGA GAAATCTATA ATAAAGCACA GCAATTAGTG
TCAGATCATT TACCCTTTAT CTATTTAGTT AATCCCTATT CTTTGTCCGC AGTGAGAAAC
CGTTTTGAAG GGATTCAATA TTCTGCTTTA GGAGGTGCAT TTTGGAATAT TGAAAAAATT
TCAGTTTCAG AAAAGAATCA ATAA
 
Protein sequence
MVLIKPLEEL KILSILLSLL TKTSELYKRS ILRRTLAFLL ACFCLLPLVG CQANANTNQV 
VFAVLSDPKT FNAVLSAESP NIFPLTYEGL ITENPLTGIK EPSLAESWEF SEDKLTIIFT
LREGLKWSDG QPLTADDVVF SYNDLYLNPK IPNNYRDSFR VGRSGTFPEI KKLDDRRIQF
KITEPFAPFL DGAEVPILPA HILRKTLEKK NKDGNPEFLS TWGTDTPPKD IIVNGPYKLK
DYVTSQRIIF EKNPYYWKKD ETGKQLPNIE QIIWAIVEST DTSLLQFRSG SLDSVSITPE
YFSLLKREED RGNFTIYNGG PAYGTVFISF NLNKGKRDGK PLVDPLKSEW FNNLNFRKAV
AYGIDRPRII NNIYRGLGGL QNTQISVQSP YYDKTIKGYD FNIEKAKALL EKEGFKLNSK
GELLDKNGNR VQFGLITNAG NKIREAMGAQ IKEDLGKLGM QVDFTPLDFN TLVGKLSNTL
DWDAHILGFT GGNEPHAPNI WYTDGNLHMF NQQPQPGQKP ITGWVAADWE KEIEQIYVEG
SQEVDQQKRR EIYNKAQQLV SDHLPFIYLV NPYSLSAVRN RFEGIQYSAL GGAFWNIEKI
SVSEKNQ