Gene Synpcc7942_2123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2123 
SymboltrpD 
ID3774342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2203801 
End bp2204847 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content62% 
IMG OID637800568 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_401140 
Protein GI81300932 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.917904 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGTTG CCCCGCCTGC CTTCGCCGAA GCTCAAGTCC TGCTGCAGCG TCTGTTAAAT 
CACGAGTCGC TAGGGGCAGT GCAGGCCCGA GCCTTGATGG AGCAGTGGTT GTCGGGCACC
CTGCCCGAGG CTCTGTCTGG GGCACTTTTG GCTGCTTTGC AGAGTAAGGG CGTGTCTGCT
CAAGAACTGG CGGCCATGGC GCAGGTGCTC CAAGAACAGG CCGTTGCAGT TGAAGCGAGC
GATCGCCGGG AGCCCCTTGT CGATACCTGT GGGACCGGTG GTGACGGTGC CGAAACTTTC
AACATCTCAA CGGCCGTGGC TTTTGTGACG GCAGCGGCTG GCGTCAAAGT TGCCAAGCAT
GGCAATCGTT CTGCCTCTGG TCGGGTGGGG TCAGCGGATG TCCTCGAAGC TCTGGGGCTT
AACCTGACAG CACCGAGCGA TCGCATCCAT GCGGCAGTGG ATGAGGTCGG CATTACCTTC
CTGTTTGCCC CAGGCTGGCA TCCCGCCATG AAAGCCGTCG CTCCGCTCCG CAAAATCCTC
GGAGTACGCA CCGTCTTTAA TCTGCTGGGC CCCTTGGTCA ACCCGCTCCG CCCGACGGGG
CAAGTCATTG GGGTCTACAA TCCCGGGCTG CTGCCCACGA TCTCAGGAGC CTTGGCGGAA
CTCGGAGTTC GTCGGGCGAT CGTGTTGCAT GGTCGCGAAG GTCTGGACGA AGGGGGGCTA
GCCGACTGCA CGGATCTGGC GATCGTGCGC GAAGGGCAGC TCAGCCAGCA GGTTGTCGAT
CCCCGCGATC TGGGTTTGAC CCAGGCCCCG ACGGTCGCAC TCAAGGGCGG CAGTGTTGAA
GAGAATGCCG ATATTCTCAA AGCGGTTTTG CAAGGGAAGG GGACGCGGGC TCAGCAGGAT
GCGGTCCTGC TTAATGCCGC CTTAGCCTTG GAAGTTGGGG AGCAGGTCGA TCGCCTTGAC
CAAGGCATCA GTTTGGCGCG ATCGGTCTTG GCCAGTGGCG CGGCGTGGCA AAAGCTCACC
CAGTTAGCGG CCTTCCTTCA GAGCTAA
 
Protein sequence
MLVAPPAFAE AQVLLQRLLN HESLGAVQAR ALMEQWLSGT LPEALSGALL AALQSKGVSA 
QELAAMAQVL QEQAVAVEAS DRREPLVDTC GTGGDGAETF NISTAVAFVT AAAGVKVAKH
GNRSASGRVG SADVLEALGL NLTAPSDRIH AAVDEVGITF LFAPGWHPAM KAVAPLRKIL
GVRTVFNLLG PLVNPLRPTG QVIGVYNPGL LPTISGALAE LGVRRAIVLH GREGLDEGGL
ADCTDLAIVR EGQLSQQVVD PRDLGLTQAP TVALKGGSVE ENADILKAVL QGKGTRAQQD
AVLLNAALAL EVGEQVDRLD QGISLARSVL ASGAAWQKLT QLAAFLQS