Gene PCC8801_4307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4307 
Symbol 
ID7102669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4526679 
End bp4528004 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content38% 
IMG OID643477287 
Productfolate/biopterin transporter 
Protein accessionYP_002374386 
Protein GI218249015 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00788] folate/biopterin transporter 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCTAG CCAAGTATCC TATCAGTCAA ATTAAAAACC GCCTAAAAAA TAGCCTACTT 
TTAGGCAATG AACCTAGCTT AGAACTCGTC GCTATTTTAA GTGTATATTT TGTCCAAGGT
ATCCTTGGTT TAGCCCGTTT AGCTATCAGT TTCTTTCTCA AAGATGATCT CAATTTAACG
CCAGCACAAG TCGGGGCATT AACGGGAATT GCTGCCTTAC CTTGGATCAT TAAACCCCTT
TTTGGCTTTC TTTCTGATGG TTTACCAATC TTTGGTTATC GTCGTCGTCC CTATTTAATC
TTATCAGGAC TATTAGGAAG CCTAGCCTGG ATGATGCTAG GAACGATTGT TGATAACCCT
TGGTCTGCTA CTGTCAGCCT TCTTTTAGCG TCCCTTTCTG TGGCTATTAG TGATGTCATT
GTAGACTCCT TAGTGGTAGA AAGAAGCCGT CAAGAATCTT TAGGTCAAGC GGGTTCTCTA
CAATCATTAA CTTGGGGAAT ATCAGCTTTA GGAGGATTAA TTACCGCCTA TCTCAGTGGT
TGGTTACTAC AAGAATTTAC ACCTCAAACA GTGTTTCAAA TTACGGCTAT TTTTCCGTTA
ATTGTTGCCT TAGTAGCTGG TTTAATTGTT GAGGAACATA TCGCACAGAA TGCCTCTCAA
ATGCCCTCGA ATAAATCCTT TAAAATTAGT CAACAACTAC AGCAATTATG GCAAGCTATC
CGTCAAAAAT CGATTCTCTT GCCTACCGCT TTTGTCTTTA TTTGGCAAGC AACACCTAAT
GCTGAATCTG CCTTGTTTTT CTTTACCACG AATGAACTGG GATTTGAAGC AGAATTTTTA
GGACGGGTTA GACTTGTTAC CAGTGTCGCT ATGTTAGCGG GAATTTGGGT CTATCAAAGA
TTTCTAAAAA ATATTTCATT TCGTCAAATT TTTTCCTGTA GTATTGTTAT CTCTTCAGTC
TTAGGAATGA CCGTATTACT GTTAGTCACT CATACTAATC GTCTTCTTGG TATTGATGAT
CATTGGTTTA GTTTAGGGGA TAGTTTAATT CTCACCGTTA TGGGACAAAT TGCTTTTATG
CCTATCTTAG TTTTATCAGC GCGTCTGTGT CCTAAAGGCA TCGAAGCAAG TTTATTTGCC
CTGTTGATGT CTAGCTTAAA TTTATCCAAT TTGGTATCCT ATGAATTAGG CTCATTACTA
ACTCAATGGT TTGGTATAAC AGAAACTAAT TTTGATCATC TTTGGTTATT AGTGATTATT
ACTAATCTTT CTACTCTTCT ACCCTTATTT TTTATTAAGT GGCTACCTGA ATCTGATCCC
TTATAA
 
Protein sequence
MTLAKYPISQ IKNRLKNSLL LGNEPSLELV AILSVYFVQG ILGLARLAIS FFLKDDLNLT 
PAQVGALTGI AALPWIIKPL FGFLSDGLPI FGYRRRPYLI LSGLLGSLAW MMLGTIVDNP
WSATVSLLLA SLSVAISDVI VDSLVVERSR QESLGQAGSL QSLTWGISAL GGLITAYLSG
WLLQEFTPQT VFQITAIFPL IVALVAGLIV EEHIAQNASQ MPSNKSFKIS QQLQQLWQAI
RQKSILLPTA FVFIWQATPN AESALFFFTT NELGFEAEFL GRVRLVTSVA MLAGIWVYQR
FLKNISFRQI FSCSIVISSV LGMTVLLLVT HTNRLLGIDD HWFSLGDSLI LTVMGQIAFM
PILVLSARLC PKGIEASLFA LLMSSLNLSN LVSYELGSLL TQWFGITETN FDHLWLLVII
TNLSTLLPLF FIKWLPESDP L