Gene Syncc9605_2603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_2603 
Symbol 
ID3735313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp2417349 
End bp2418647 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content64% 
IMG OID637777189 
Productanthranilate synthase 
Protein accessionYP_382885 
Protein GI78214106 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0445726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0808647 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCC CACACCGGCT TGAGCTGCCC TGGCAGGAGC CCCAATCCCT CGCGCACCAA 
CTGGCCCATG CCTACGGCGA GGAAGGGATG GTCTGGCTGG ATGGCGACGG CAGCAGCCTG
GGCCGACGGG CCACCCTGGC AGTGGCACCC CAGGAGATCA TCTGCTGCCG CGGTCTCCCC
GACGAGCCAG GAGCCAGCAA TCCCTTCGAG GCACTGCGGG GACTGGCCCC AGGGCATTGG
TGCGGCTGGC TGAGCTATGA AGCCGCCGCC TGGGTGGAAC CGGGAAACCC CTGGGCCAGC
GACGGCATGG CCACGCTGTG GATCGCCCGC CACGATCCGG TGCTTCGCTT TGATCTGCAA
AAGCGCAGGC TGTGGATCGA AGCCAGCAGC ACGGCTGCTC TGGACCGCCT CACCCAACAG
CTGGCCTCCG TCACTGAGCA GCCCAAAGGC AAGCCCCCAT CCATCCCCCT GACGGCCTGG
CATCACCACA CCTCAGCAGA TCACTACGCC GCAGGTGTGC AGCGCATCCG TGATCTGATC
GCGGCAGGCG ATCTCTTCCA AGCCAATCTC ACGGCTTGTT GCAGCACAGC TTGGCCCCAG
GGAGGCAATG CCCTCGAGCT GTTTGTCACC CTTAGGGAAG CCTGCCCTGC TCCCTTTGCA
GGGCTGATCA TCAGCGACCA AAACGAGGCG TTGTTGTCAT CGTCCCCGGA GCGGTTTCTG
CAGGTGAGTG CCGAGGGAGC TGTACAAACC CGGCCGATCA AAGGCACCAG GCCTCGCCAT
GGCGACCCCG AACAGGATGC GAATCTCGCC ACGGAACTCG TGTGCAGCGA TAAGGACCGG
GCCGAGAACG TGATGATCGT CGACCTGCTG CGGAATGACC TCGGTCGTGC CTGCCAGCCG
GGTTCGATCC AGGTTCCCCA ACTGGTGGGG CTCGAAAGTT ACGCCTCCGT GCATCACCTC
ACCTCGGTCG TGGAGGGACA GCTGCAGGCC GGGTTGAGCT GGGTCGATCT CCTGGAAGCC
AGTTGGCCTG GGGGGTCGAT CAGCGGGGCG CCGAAACTGC GGGCCTGCCA ACGTTTGCAT
GAGCTCGAGC CCACCAGCCG AGGGCCTTAC TGCGGATCAC TGCTGCGGAT CGACTGGGAC
GGCAGCTTCG ACAGCAACAT CTTGATCCGA TCTTTACTGC GCCAAGGCGA CACCCTGCGG
GCCCATGCCG GCTGCGGAAT TGTCGCCGAC TCGGATCCCC TTGGCGAAGC AGAGGAGTTG
ATGTGGAAAC TGCAGCCATT GCTGGAGGCG CTGGCATGA
 
Protein sequence
MIRPHRLELP WQEPQSLAHQ LAHAYGEEGM VWLDGDGSSL GRRATLAVAP QEIICCRGLP 
DEPGASNPFE ALRGLAPGHW CGWLSYEAAA WVEPGNPWAS DGMATLWIAR HDPVLRFDLQ
KRRLWIEASS TAALDRLTQQ LASVTEQPKG KPPSIPLTAW HHHTSADHYA AGVQRIRDLI
AAGDLFQANL TACCSTAWPQ GGNALELFVT LREACPAPFA GLIISDQNEA LLSSSPERFL
QVSAEGAVQT RPIKGTRPRH GDPEQDANLA TELVCSDKDR AENVMIVDLL RNDLGRACQP
GSIQVPQLVG LESYASVHHL TSVVEGQLQA GLSWVDLLEA SWPGGSISGA PKLRACQRLH
ELEPTSRGPY CGSLLRIDWD GSFDSNILIR SLLRQGDTLR AHAGCGIVAD SDPLGEAEEL
MWKLQPLLEA LA