Gene PCC8801_1444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1444 
Symbol 
ID7103647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1516043 
End bp1517095 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content43% 
IMG OID643474520 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002371657 
Protein GI218246286 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAAAG TATCGAGAAG AGTGTTTTTA GGAACGGGTG CAGCAGCAGC AACAGTTGCC 
TTAAGTCCGT TAATTCATGT CAAAGAAAGT TCCGCCCAAA ACCGAGAGGT TAACGTCTAT
TCATCGCGTC ATTACAATAC CGATAGTCGC TTATATGAAA ACTTCACCCG TCAAACCGGA
ATTAAGGTTA ATTTAATTGA AGGAGAAGCC GATCCGTTAA TAGAAAGAAT CAAAAGTGAA
GGAAAAAATA GTAAGGCAGA TATCTTAATT ACTGTTGATG CAGGACGCTT ATGGAGAGCG
GATCAAGCAG GAATTTTTGC CCCTGTTAAC TCTAAGATTT TACAACAAAA AATCCCCGCT
TCTCTCAGAC ATCCTAAAGG GCATTGGTTC GGGTTTAGTA AGCGATTGCG CGTTATTATG
TATAGCAAAG CAAGGGTCAA TCCATCCCAA CTTTCAACCT ATGAAGATCT CGCTAATCCG
AAGTGGAAAG GAAAGGTCAT TACTCGTTCT TCTACTAATA TTTATAGCCA ATCTCTTTGT
AGTTGGATGA TCGCCGTTAA TGGACAAGGG GCAACGGAAA AATGGTGTCG AGGATTAGTG
GCTAATTTTG CCCGTTCTCC CCAAGGTAAT GATACTGCCC AAATTGAAGC ACTCGCAGCA
GGGGTAGCTG ATTTAGCCCT AGTTAATACC TATTATTTGG CGAATTTAAT CGATAGTAAA
GACGAGAAAA AACGGGCGAT TGGTCAACAA GTCGGGGTAT TTTTCCCCAA TCAAAAAGGA
CGGGGAACTC ACGTCAATAT CAGTGGCGGA GGTTTGGTCA AAACTGCCCC AAATCGCAAC
GCAGCCGTTA AATTCCTCGA ATATCTCGTC AGTCCTCAAG CACAAACTTT CTTTGCCCAA
GGAAACCTCG AATATCCCGT GGTTTCAGGG GTACAGATTG ATCCCGTTTT AGCGAAATTT
GGAAAATTTA AGTCTGATAT CGCCAGGGTA GACGATTATG GACTTAATTT GGCCAAGGCT
GTCCAGGTGA TGGATCGGGC GGGGTGGAAA TAG
 
Protein sequence
MTKVSRRVFL GTGAAAATVA LSPLIHVKES SAQNREVNVY SSRHYNTDSR LYENFTRQTG 
IKVNLIEGEA DPLIERIKSE GKNSKADILI TVDAGRLWRA DQAGIFAPVN SKILQQKIPA
SLRHPKGHWF GFSKRLRVIM YSKARVNPSQ LSTYEDLANP KWKGKVITRS STNIYSQSLC
SWMIAVNGQG ATEKWCRGLV ANFARSPQGN DTAQIEALAA GVADLALVNT YYLANLIDSK
DEKKRAIGQQ VGVFFPNQKG RGTHVNISGG GLVKTAPNRN AAVKFLEYLV SPQAQTFFAQ
GNLEYPVVSG VQIDPVLAKF GKFKSDIARV DDYGLNLAKA VQVMDRAGWK