Gene Cyan8802_2086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2086 
Symbol 
ID8391403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2098646 
End bp2099737 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content44% 
IMG OID644980065 
Productextracellular solute-binding protein family 3 
Protein accessionYP_003137809 
Protein GI257059921 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.256894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAATC GAACCTGGTC TATGACTAAA CAACCGTTTT TAGTTCTAGC AACGCTCTTG 
TTACTGTCTC CTTTAACCGC TTGTGGTGGA GGACAACCGA CAACCGAAAC GACCCCCGCA
CAAGAGAGTT CATCAAAAGT CAGTGGGAGT CGTTTAGCGA CGATCAAAGA ACGGGGAACC
CTCATTTGTG GGGTTAACGG AGAAGTCCCT GGATTTAGCT TTGTTGATGA ACAAGGCCAA
TATTCTGGGT TAGATGTGGA TATGTGTCGG GCGATCGCGG CTGCTTTATT TGATGACCCC
TCTAAGGTTG AATATCGCAA ACTCAGTGCC CAAGAACGCT TAACGGCTGT TCAGTCCGGC
GAAGTGGACG TTCTTAACCG TAATACCACC TGGACGATGA GTCGTGATAC TGCCGTGGGA
ATGGAATTTG CTCCTACAGT TTTCTATGAT GGTCAAGGAA TCATGGCAAC TAAAGCCAGT
GGAGCGAATA CATTAAAAGA TTTAACGGGT AAATCGATTT GTGTCCTAGC AGGAACCACA
ACGGAACAAA ATTTAGCCGA TCAGATGCGT AAAGAAGGGG TAACGGATTA TAATCCCGTC
GTTTCCGATG ATGTGGATGC GCTCTATGCA GCCTATCAAG AAGGTCGCTG TGAGGCGGTT
ACGTCTGATC GCTCGCAATT AGTCGCTCGT CGTTCTATTT TCCCCAAAAA AGACGATCAT
GTCATCTTAG ATGTGGTTAT GTCTAAAGAA CCTTTAGGAC CTGTGGTAGC TGATGGGGAC
TCCACTTGGT ATGATGCCGT TAAATGGATT ACTTTTGCCG TTATTCAAGC CGAAGAATTT
GGCATTACTT CCCAAAATTT AGCCACCTTT GAATCGACTG AAGATCCTAA TATTAAACGA
TTTTTAGGAA TCGATGATAA ATTAGGCGAA GACATGGGAT TACCGAACGA TTTCGCCGCT
CGTATTATTA AGCACGTTGG TAATTATGGA GAAATTTATG AGCGTAACAT CGGTAAACCG
TTAGGATTAG AACGGGGTCA AAATCAACTT TGGACTAATG GCGGTTTACT TTATTCTCCT
CCTTTTCGAT AG
 
Protein sequence
MFNRTWSMTK QPFLVLATLL LLSPLTACGG GQPTTETTPA QESSSKVSGS RLATIKERGT 
LICGVNGEVP GFSFVDEQGQ YSGLDVDMCR AIAAALFDDP SKVEYRKLSA QERLTAVQSG
EVDVLNRNTT WTMSRDTAVG MEFAPTVFYD GQGIMATKAS GANTLKDLTG KSICVLAGTT
TEQNLADQMR KEGVTDYNPV VSDDVDALYA AYQEGRCEAV TSDRSQLVAR RSIFPKKDDH
VILDVVMSKE PLGPVVADGD STWYDAVKWI TFAVIQAEEF GITSQNLATF ESTEDPNIKR
FLGIDDKLGE DMGLPNDFAA RIIKHVGNYG EIYERNIGKP LGLERGQNQL WTNGGLLYSP
PFR