Gene Synpcc7942_0934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0934 
Symbol 
ID3775211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp940995 
End bp942056 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content59% 
IMG OID637799352 
Producthypothetical protein 
Protein accessionYP_399951 
Protein GI81299743 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.464361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGC GATTCCGAGA CCTAATCCGT AAGGTTGGCA GTGGACGCCA CACGAGCCAA 
GTTTTGACGC AGGCTGAGGC CGCCGAAGCC CTGCAGCTCA TGCTGTCAGC CACGGCGACC
CCCGCCCAAA TTGGGGCTTT TTTAATTGCC CATCGCATTC GTCGCCCGAC GGGAACCGAA
CTCGCTGGAT TTTTAGAGAC CTACGCCGAC TGGTTGCCTG CTGTCTTGGC TCCGAGTACA
ACACGGCCGC CACTGGTCTT GGGCTATCCC TACGATGGCC GCGATCGCAC GGCTCCTTTG
GGACCACTGC TGGCGCTGCT GCTCGCCGCT GTTGGTCAAC CCGTTGTGCT GCACGGCAGC
GATCGCGTTG CAACAAAATA CGGTGTGCCT CTGGTTGAAC TCTGGGATGC GATCGGGGTG
AACTGGCGAT CGCGTTCTGT TGCTGATCTC AACCGCTGTC TCGAACAAGC TGGTGTAGCC
CAACTCCATC AACCCAGTCT TTGCCCGGCT GCTGAAGTGT TGAATGGCTA TCGCTCGGAA
TTGGGCAAAC GACCGCCGCT TGCCACTGCC GAATTGATGC TCGTTCCTGT CCAAGGGGCA
GCTTTGCCGG TTTGTGGCTT CGTCCATCCG CCGACGGAGT TGATGATCGA AGAAGCCCTG
AGTCTGCGCG GGATCACGAC CTTCTTCACC ATCAAAGGCT TGGAGGGGAG TCCAGAGCTA
CCGCGCGATC GCGCCGCAAT CGTGGGCCGC TGGCAGAATG GCCACTGCGA TCGTCTGATC
TTGCATGCCC GCGACTGGGA TTTAGGCGAG GCAGAACTGC CTTGGATGGG GGAAGACGCT
TGGGTGGAGG CTGCTCAAGC CCTACTGGAA GGTCAGCCTT CAGTGCTAGA ACCGCTGCTG
CGCTGGAATG GCGCGGCCTA TCTCTGGTTT TTGGGCATGG CCTCATCAAT GACGGCAGGG
TTGGTTCAGG TGGATCACCT GTTACAAACC AAGGCGCTGC TCCAACAACG CGATCGCTTG
CAACAGATTC TTCAACTCGT ACCCGATTTC TCACTCTCTT GA
 
Protein sequence
MSERFRDLIR KVGSGRHTSQ VLTQAEAAEA LQLMLSATAT PAQIGAFLIA HRIRRPTGTE 
LAGFLETYAD WLPAVLAPST TRPPLVLGYP YDGRDRTAPL GPLLALLLAA VGQPVVLHGS
DRVATKYGVP LVELWDAIGV NWRSRSVADL NRCLEQAGVA QLHQPSLCPA AEVLNGYRSE
LGKRPPLATA ELMLVPVQGA ALPVCGFVHP PTELMIEEAL SLRGITTFFT IKGLEGSPEL
PRDRAAIVGR WQNGHCDRLI LHARDWDLGE AELPWMGEDA WVEAAQALLE GQPSVLEPLL
RWNGAAYLWF LGMASSMTAG LVQVDHLLQT KALLQQRDRL QQILQLVPDF SLS