Gene Syncc9902_1931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_1931 
Symbol 
ID3743811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp1844798 
End bp1846318 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content58% 
IMG OID637772126 
Productanthranilate synthase 
Protein accessionYP_377932 
Protein GI78185497 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0736308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAGTC CTGATCGCGA CGCCTTTCGC AAGGCTGCAG CCAACGGCGC CAATCTGATT 
CCCTTGGCCC AAAGTTGGCC AGCAGATCTG GAAACGCCTC TCACCACCTG GATCAAGGTT
GGAGCGAACC ATCCACCTGG TGTGTTGCTG GAATCCGTTG AAGGGGGTGA AACCCTGGGG
CGATGGAGTG TGATCGCCTG TGATCCCCTA TGGACAGCTT CGGCAAGGGG TAGCCAACTC
GTCCGACGGT GGAGAGATGG ACGTGAAGAC AGTCTGGAAG GGAACCCACT CCAATCCCTA
CGCACTTGGC TTAATCCTTA CGAGTGCGTC AATTTGCCTG GGCTGCCGCC CCTTGGGCAG
ATCTACGGAA TGTGGGGCTA TGAACTCATC CAATGGATTG AACCCACCGT CCCCGTTCAC
CCCAGGCTGA GCAGCGATCC TCCCGATGGG ATCTGGATGA TGATGGACGC AATCCTGATC
TTCGATCAGG TAAAGCGGCA AATCACAGCC GTGGCCTTTG GTGACCTCAC AAGCGGGGCC
ACAGAAGATG AGGCCTGGCA TAGCGCCTTA GCCCGGATCG AAGGATTGCG GGAGCGCATG
AACGCTCCAC TCCCTGCGGT GGATCCCTTG CAATGGGATC CCAATGCTCG CGATCTGCCT
GATGTCTCGA GCAATCGCAG CCAAGCTGAA TTTGAAACGG CCGTTCGCAC CGCCAAAAGC
CATATTGCTG CTGGAGATGT TTTCCAGCTG GTGATCAGCC AGCGACTCGA GGCCCAGGTC
CCCCAGAGCC CACTGGAGCT CTATCGCAGC CTGCGGATGG TGAACCCCTC GCCTTACATG
GCCTTCTTCG ATTTCGGTGA TTGGCAGCTC ATCGGATCCA GCCCTGAGGT GATGGTGCAA
GCGGAACCAG CCGCCAACGG CATTCAGGCC AGCTTGCGTC CCATCGCTGG GACGCGACCC
CGCGGACAAA CCCCACTGGA AGACCGAGAA CTCGAAGTGG ACCTGCTCGC CGACCCAAAA
GAACGTGCCG AGCATGTGAT GCTCGTTGAT CTCGGGCGCA ATGATCTCGG GCGCGTTTGC
TTGCCCGGAA GTGTTTCCGT CAAAGACCTA ATGGTGATAG AGCGTTACTC CCACGTGATG
CACATCGTGA GTGAAGTGGA AGGATGCCTT GCTCCACACC ATGACGTTTG GGATCTGCTC
ATGGCCTCAT TCCCAGCAGG AACAGTGAGC GGTGCTCCAA AAATTCGAGC GATGCAACTC
ATCCATGCTC TGGAACCGGA TGCAAGAGGT CCCTACTCAG GGGTGTATGG CTCTGTTGAT
TTGGCCGGCG CGTTGAATAC GGCCATCACG ATCCGCACGA TGGTGGTGCA ACCCAACGGC
AGCGGAGGAT GTCGTGTGAA GGTCCAAGCT GGGGCTGGCG TCGTGGCCGA TTCCCAGCCA
ACGGCCGAAT ACGAAGAAAC CCTCAACAAA GCCCGCGCGA TGCTCACCGC CTTGGCTTGT
TTAAACCCGG CGGAATCATG A
 
Protein sequence
MFSPDRDAFR KAAANGANLI PLAQSWPADL ETPLTTWIKV GANHPPGVLL ESVEGGETLG 
RWSVIACDPL WTASARGSQL VRRWRDGRED SLEGNPLQSL RTWLNPYECV NLPGLPPLGQ
IYGMWGYELI QWIEPTVPVH PRLSSDPPDG IWMMMDAILI FDQVKRQITA VAFGDLTSGA
TEDEAWHSAL ARIEGLRERM NAPLPAVDPL QWDPNARDLP DVSSNRSQAE FETAVRTAKS
HIAAGDVFQL VISQRLEAQV PQSPLELYRS LRMVNPSPYM AFFDFGDWQL IGSSPEVMVQ
AEPAANGIQA SLRPIAGTRP RGQTPLEDRE LEVDLLADPK ERAEHVMLVD LGRNDLGRVC
LPGSVSVKDL MVIERYSHVM HIVSEVEGCL APHHDVWDLL MASFPAGTVS GAPKIRAMQL
IHALEPDARG PYSGVYGSVD LAGALNTAIT IRTMVVQPNG SGGCRVKVQA GAGVVADSQP
TAEYEETLNK ARAMLTALAC LNPAES