Gene PCC8801_0377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0377 
Symbol 
ID7103340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp377601 
End bp379124 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content45% 
IMG OID643473487 
Productanthranilate synthase component I 
Protein accessionYP_002370631 
Protein GI218245260 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTTC CCGATTTCAA GCAATTTTGC GCTTTGGCAC AACAGGGGAA TTTTGTCCCC 
GTGTATCAAG AATGGGTTGC CGACCTAGAA ACCCCGGTTT CTTCTTGGTA TAAGGTCTGT
GGTGATCAAC CCTACAGCTT TCTATTAGAG TCGGTGGAAG GGGGAGAAAA CCTCGGACGC
TATAGCTTTT TGGGATGTGA CCCCGTTTGG GTATTAGAAA CCAAGGGAGA GACGACGACT
CAAACCTATC GAGATGGTAG CATTAAGATA TTTCAGGGCA ATCCTTTGGA TATTTTGCCT
CAATGCTTGG CAACAATTCA TCCAGTCATG TTACCTCAAC TTCCCTCAGG AATTGGGGGA
TTATTTGGAT TTTGGGGCTA TGAGTTAATT CATTGGATTG AACCGCGAGT CCCCATCTAT
CCGTGTACCC AAGAGGACTT ACCCGATGGA ATCTGGATGC AGGTAGATAA TTTAATTATT
TTTGATCAGG TGAAGCGAAA AATTTGGGCG ATCGCCTATG CTGATTTACG AGGGGAAAAA
GTTGACCTCA AACAAGCCTA TCAACAAGCT TGCGATCGCG TTACTAAGTT AGTGATCAAG
CTACAACTTC CCTTACCAGT AGAAGCCAAA ACCCTAGAAT TAAACCCAAA ATCAGCCGAA
TCTGACCCAT TAAATTATAA TAGCAATATA GAGCGATCGC GCTTTTGTGA AAATGTCCTC
AAAGCCAAGG AATATATCCG TGCCGGGGAT ATCTTTCAAG TCGTGCTTTC TCAACGCCTG
ACAGCCCATT ATAGCGATGA TCCCTTCAAT CTTTATCGTT CCCTGCGGTT GATTAATCCG
TCTCCCTATA TGGCCTATTA CAATTTTGGA GACTGGCAAA TTATTGGGTC AAGTCCAGAA
GTAATGGTTA AGGCTGAACG GATAGAAGAA AAGAAAATTA AAGCAACCCT AAGACCCATC
GCGGGAACCC GAAAACGGGG TAAGACAGTG GCAGAAGATC AGGCATTAGC TCAGGATTTA
CTGCAAGATC CCAAGGAAAT CGCCGAGCAC GTCATGTTAG TGGACTTGGG AAGAAATGAT
TTAGGCCGGG TCTGCGTCGA AGGAACGGTT ACTATTGATG AGCTCATGGT GATTGAACGC
TACTCCCATG TTATGCACAT CGTCAGCAAT GCGATCGGAG AATTGTCCCC TGATAAAACG
GCCTGGGACT TATTAAAAGC CTGTTTTCCG GCAGGAACCG TCAGTGGTGC ACCCAAAATC
CGTGCCATGG AAATTATCCA TGAATTGGAA CCCGAACGAC GAGGCCCCTA TTCGGGGGTT
TACGGTTACT ACGATTTTGA GGGACAGCTA AATACAGCGA TCGCCATTCG GACTATGGTA
GTTCGTCCGT TAGGGGGCAA TCAACATCGG GTTTCGGTAC AAGCCGGAGC CGGGTTAGTA
GCAGATTCTG ACCCCGAAAA GGAATATGAA GAAACGTTAA ATAAAGCAAG GGGATTGTTA
GAAGCCATTC GTTGTTTAAG TTAA
 
Protein sequence
MIFPDFKQFC ALAQQGNFVP VYQEWVADLE TPVSSWYKVC GDQPYSFLLE SVEGGENLGR 
YSFLGCDPVW VLETKGETTT QTYRDGSIKI FQGNPLDILP QCLATIHPVM LPQLPSGIGG
LFGFWGYELI HWIEPRVPIY PCTQEDLPDG IWMQVDNLII FDQVKRKIWA IAYADLRGEK
VDLKQAYQQA CDRVTKLVIK LQLPLPVEAK TLELNPKSAE SDPLNYNSNI ERSRFCENVL
KAKEYIRAGD IFQVVLSQRL TAHYSDDPFN LYRSLRLINP SPYMAYYNFG DWQIIGSSPE
VMVKAERIEE KKIKATLRPI AGTRKRGKTV AEDQALAQDL LQDPKEIAEH VMLVDLGRND
LGRVCVEGTV TIDELMVIER YSHVMHIVSN AIGELSPDKT AWDLLKACFP AGTVSGAPKI
RAMEIIHELE PERRGPYSGV YGYYDFEGQL NTAIAIRTMV VRPLGGNQHR VSVQAGAGLV
ADSDPEKEYE ETLNKARGLL EAIRCLS