Gene Tery_5008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_5008 
SymbolaroB 
ID4246663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7651699 
End bp7652790 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content42% 
IMG OID638109818 
Product3-dehydroquinate synthase 
Protein accessionYP_724394 
Protein GI113478333 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCCA TCACTGTTCA ACTTCCCCAA AAATCCTATG AAATAGCGAT CGCTTCAGGC 
CACCTCGACC AACTTGGCAG AAAAATGGAA TCCCTCAACC TGGGAAAAAA GGTCTTGCTG
GTATCCAACC CAGAAATATT TGCTCATTAT GGCGAAAGAG CAATTATCTC ACTCCAAGAA
GCCGGTTTTG ATGTCTCGGA CTGCATTCTC CCCTCAGGAG AAGAATATAA AACTCCTCAA
AACCTTAACT GTATTTATGA TGCAGCTTTA GCACACCGTC TCGAACGCTC TTCCACAATA
GTCGCTCTTG GTGGTGGAGT AGTTGGGGAT ATGACCGGGT TTGCTGCGGC AACTTGGTTA
CGCGGTTTGA ATGTAGTACA AGTTCCTACT TCTCTTTTGG CGATGGTCGA TGCTGCTATT
GGTGGAAAAA CAGGAGTTAA TCATCCTCAA GGCAAAAATC TTATCGGTGC TTTCCATCAA
CCACGGTTAG TCTTAATTGA TCCAGAGGTA CTAAAGACTT TACCCTTGCG AGAATTTCGG
GGAGGGATGG CAGAAGTGAT CAAATATGGA GTTATATGGG ATGCCGAGTT GTTTTTTCAA
ATGGAAAATA GTCAGAGTCT TGATGACATT AACAATTTAA CACCAGGGTT ATTAGAGGAA
ATCTTGATTA AGTCTTGCCA AAGTAAAGCA CATGTGGTAG CAAAAGATGA GAAAGAATCT
GGGTTAAGAG CAATTTTGAA TTACGGTCAT ACCATAGGTC ATGCAGTGGA AAGTTTGACT
GGTTATACCG CGGTGACTCA TGGTGAGGCG GTCAGTATTG GGATGGTGGC AGCAAGTGGG
TTAGCATTAG AGTTAGGAAT GTGGGATGAG CAGAGCGATC GCCGTCAGTT AGTCTTGATA
GAAAAAGCTA GTTTGCCAAC CAAACTTCCG GATGGCTTGG ATATTGATGA TATTTTGGTT
TCTTTACAGA CAGATAAAAA GGTAAAAGCA GGTAAGGTAC GATTTGTTTT ACCTACTGGA
ATAGGATCAG TTACAGTGAC AGATAAGGTA AGTCAAGATG TGTTGAGGAG AGTATTGTTG
AGAATCAGTT AA
 
Protein sequence
MASITVQLPQ KSYEIAIASG HLDQLGRKME SLNLGKKVLL VSNPEIFAHY GERAIISLQE 
AGFDVSDCIL PSGEEYKTPQ NLNCIYDAAL AHRLERSSTI VALGGGVVGD MTGFAAATWL
RGLNVVQVPT SLLAMVDAAI GGKTGVNHPQ GKNLIGAFHQ PRLVLIDPEV LKTLPLREFR
GGMAEVIKYG VIWDAELFFQ MENSQSLDDI NNLTPGLLEE ILIKSCQSKA HVVAKDEKES
GLRAILNYGH TIGHAVESLT GYTAVTHGEA VSIGMVAASG LALELGMWDE QSDRRQLVLI
EKASLPTKLP DGLDIDDILV SLQTDKKVKA GKVRFVLPTG IGSVTVTDKV SQDVLRRVLL
RIS