Gene Syncc9902_2289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_2289 
Symbol 
ID3743443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp2195974 
End bp2197044 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content58% 
IMG OID637772489 
Producthypothetical protein 
Protein accessionYP_378290 
Protein GI78185856 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACGATCG CAGAGGGTTC CGGCAAAGAA CGGTTCAAAA ACCACCTGCG CAAGGTGGGC 
AGCGGTGAAC ACACCAGCAA AGGTCTGAGT CGTGAAGAAG CTGCAGAAGC TCTCGATCTG
ATGCTTCAGC AGGAAGCAAC CCCCGCTCAA ATCGGCGCTT TTCTGATTGC CCATCGCATC
CGTCGCCCTG AGCCGCAGGA GCTCACGGGA ATGCTGGACA CGTATCGACA CCTTGGGCCG
AAACTCCAAT CCAAAGCTGG CCAGACTCCG CCGGTGTGCT TCTGCATGCC CTTCGATGGA
CGCACCCGGA CCGCCCCCAT CTATCCCCTC ACAACGTTGG TGCTTCTGGC GTTAGGTCAA
CCCGTGGTGC TCCAAGGCGG GAATCGCATG CCAATCAAAT ATGGCGTGAC CGCCATCGAT
CTGTTTCGAG AATTAGGCGT GGAGCTGAGC GGACTCCCCC TTGAAACAGT GCAAGACGGG
CTGCAAACCA ACGGCTTCGC GTTGGTGCAT CAACCCGATC ACTTCGCTAT CGCCGAGAGC
TTGATCACCT ATCGCGAAGA ATTGGGCAAA CGACCCCCCG TCGCCAGCCT TGAGCTGCTC
TGGACCCCTC ATCAAGGCGC ACACCTCTTG ATCAGCGGTT TTGTTCACCC TCCAACCGAA
AGTCGAGCAT GGGAAGCACT CCGCCTTGCG GAAGAAGCCC AGGTGGTCAC CGTCAAAGGC
TTAGAAGGCG GAACCGACCT GCCAATTGGG CGCGCCTGCA TCACCGCAAA GGTTGATGGC
GGTCACGCAC AACGTTTAAT TCTTCACCCG CGAGACCACG ACTGCTATGA AGCAGATCTG
GAATGGACCG ATCCTGCAAC CTGGGCCCAA CAGGCCCTTG AAGCGCTCAA CAACAGTGGT
CCCCTGCTCA GTGCCTTGCG TTGGAATGCT GGGGTCTATC TCTGGTTTGC TGGTCAGAGT
GCAACCCTCG AGGCTGGTCT TGAACGTGCG CAAGAAGCCC TCGAAGGCGG CACAGCGCTA
ACGGCCTTGC ATCAACTTCA GGCGTGGAGC AAGGCCTTGG CCATGCGATA G
 
Protein sequence
MTIAEGSGKE RFKNHLRKVG SGEHTSKGLS REEAAEALDL MLQQEATPAQ IGAFLIAHRI 
RRPEPQELTG MLDTYRHLGP KLQSKAGQTP PVCFCMPFDG RTRTAPIYPL TTLVLLALGQ
PVVLQGGNRM PIKYGVTAID LFRELGVELS GLPLETVQDG LQTNGFALVH QPDHFAIAES
LITYREELGK RPPVASLELL WTPHQGAHLL ISGFVHPPTE SRAWEALRLA EEAQVVTVKG
LEGGTDLPIG RACITAKVDG GHAQRLILHP RDHDCYEADL EWTDPATWAQ QALEALNNSG
PLLSALRWNA GVYLWFAGQS ATLEAGLERA QEALEGGTAL TALHQLQAWS KALAMR