Gene PCC8801_3676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3676 
Symbol 
ID7102926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3842877 
End bp3844280 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content47% 
IMG OID643476690 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_002373793 
Protein GI218248422 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAACA AAAATTGGAC TAGACGGCAA GCCCTATTAG GGTTAGGGGG TCTTGCGGGG 
GCTGTGGCCT TTTCTTCGTG TGGCATTAAT ACAAACCGCG CCCCGAAAAG CCTGACAGAA
GCTGCCCTTG CCGTTGATCA AGTCGTTAAA CCCGAAACCC TTGAAAAACC CAATCTCAAA
ATTGGTTATG TTCCCGTCAA TGACTGTGCC CCCTTTGCCA TTGCTTGGGA AAAAGGCTTC
TTTCACAAGT ATGGTTTAAA CGTCACCCTC AGTCGTGAGG CTAGTTGGGC AAACTCCCGT
GATGGGGTAA TCTTCGGTCG TTTGGATGCG TCTCCAGTTG TTTCTGGGGC GGTTACTAAT
GCTAGAATTG GGGCTGAAGG AGCCCGTCAC GCTCCCTTGT GTGCAGCCAT GACCATTCAT
CGTCACGGCA ACGCCATGAC CATGAATCAA GGACTGTGGG ATGGAGGGAT TCGTCCTTGG
AAAGAATATA AAGGGGATTT AGACGCATTT GGTCGAGATT TTAAGGACTA TTTTGCAAAA
GCTCCGTCAG ACAAACGGGT TTGGGCGGTA GTACTCAGTT CAGCTATTTA CGAATACTTT
ACCCGTTATG TAGTGGCAGC CGCTGGACTC AATCCTACTG AGGAATTTCG GATCATTATC
ACTCCCCCTC CGCAAATGGT CAGTAATATG CGAATTGGCG CGATGCAAGC TTATATGGTG
GCCGAACCCT GGAATACTCG CGCTATTTCG GGGAACGAGG GGATAGGCTT TACCTTTGCC
CAAGGACGAG AAATCTGGCG GGGACATCCT GACCGAGTGT TGGCGGTAAC GGAGTCCTTT
ATCGAGGAAA ATCCCAAAAC CTATCGATCG CTGGTGAAAG CCTTAATTGA AGCCTGTCAG
TATTGCAGTA AGCCCGAAAA CCGCGAAGAA GTGGCTAAAA TTATCTCGAC TCGTCCCTTT
ACGGGGGCAA AACCTCAATA TACACGACCT GGAATAGTTG GAGATTACAA CTACGGAGGA
TTTGATGAGC AAAAACGGGT GGTTAATAGT CCAGAAACGA CGATTTTTTA CAATCTTCCT
GAGGGAGTTT CTGCTGTTCC TCACGATCAT TCGACTTTTC TCTGGCAATC TCAAAGTCTC
TGGTTAATGA CCCAAGCCAC TCGATGGCAA CAGATCCGAG AATTTCCCAA AAATGCTGAA
AAAATTGCTC GTCAGGGTTG GAAAACGGAT TTGTATCGAG AAATTGCTGC TGAAATGGGG
ATTAAATGTC CTTCCCAGGA TTACAAAGTT GAACCCGCCG AGGCTTTTAT TGATAATAAA
GCCTTTGATC CGAGTGATCC GATTAACTAT CTCAACAGCT TTGAAATTCG CGCCAACGCG
CCTCAATCTT TCTTCATGTC TTAA
 
Protein sequence
MNNKNWTRRQ ALLGLGGLAG AVAFSSCGIN TNRAPKSLTE AALAVDQVVK PETLEKPNLK 
IGYVPVNDCA PFAIAWEKGF FHKYGLNVTL SREASWANSR DGVIFGRLDA SPVVSGAVTN
ARIGAEGARH APLCAAMTIH RHGNAMTMNQ GLWDGGIRPW KEYKGDLDAF GRDFKDYFAK
APSDKRVWAV VLSSAIYEYF TRYVVAAAGL NPTEEFRIII TPPPQMVSNM RIGAMQAYMV
AEPWNTRAIS GNEGIGFTFA QGREIWRGHP DRVLAVTESF IEENPKTYRS LVKALIEACQ
YCSKPENREE VAKIISTRPF TGAKPQYTRP GIVGDYNYGG FDEQKRVVNS PETTIFYNLP
EGVSAVPHDH STFLWQSQSL WLMTQATRWQ QIREFPKNAE KIARQGWKTD LYREIAAEMG
IKCPSQDYKV EPAEAFIDNK AFDPSDPINY LNSFEIRANA PQSFFMS