Gene P9211_07971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_07971 
SymbolaroB 
ID5730097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp702009 
End bp703121 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content39% 
IMG OID641285161 
Product3-dehydroquinate synthase 
Protein accessionYP_001550682 
Protein GI159903338 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.277449 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00421114 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAACCAAA ACTCAATCCG AATAAAAATA AAATTAGCTC ACAACCCATA TGAAGTTGTA 
ATTAAGAAAA ATGGTTTAGC GCGAATAGGC GAAGAGTTAA AAAAAATAGG CTTCAAGAAA
GCAACAAAAG TTCTTGTAGT TACCAATAAA GATGTTTCAG TCCACTATGG AAAAGAGTTT
ATTCACAACC TAAGTGACAA TGGCTTCAAC CCAACCTTAA TTGAGATAAA AGCAGGGGAA
GAAAGAAAGA ATCTCGCAAC CATATCTGAT ATTCACAATG CTGCTTACAC ATCAAGACTT
GAAAGGGGTT CATTAATGAT TGCCCTTGGG GGTGGAGTTA TTGGCGATAT GACAGGCTTT
GCAGCTGCTA CCTGGCTAAG AGGAGTTTCT TTTGTACAAG TTCCTACAAC TTTACTAGCC
ATGGTTGATG CCTCTGTTGG AGGCAAAACG GGAGTCAACC ATCCCAAAGG GAAGAACCTA
ATAGGTGCGT TTCATCAGCC AAAACTAGTT CTGATTGATC CAATAACATT AAAAACCCTG
CCCGAACGTG AATTCAAAGC TGGGATGGCA GAAGTCATCA AATATGGTGT TATTAGTGAC
AAAAAGTTAT TCCGGAAACT GGAAGATGCA CCAAGACTTG ACAAGCTAGA AACACTTACT
GACAGATTTT TATTGGAGAT AATCCAAAGG TCCGTTCAAA CTAAAGCACA TATCGTAGAA
CTAGATGAGC GAGAGGGTGG CATACGAGCT GTACTTAATT ATGGTCATAC ATTTGGACAT
GCGATTGAAG CTTTATGTGG CTATGGTACA TGGCTTCACG GTGAAGCTGT TTCTATGGGC
ATGATCGCCA TAGGTCAACT AGCTTTAGAG CGAAACATAT GGAATATTAG CGACCTAGAA
AGACAACGTA AGGTTCTGTG TCAAGCAGGG TTGCCTACAA TTTGGCCAAG GGTTTGTGCT
GAAGATGTTA TAGAAATACT TAAAAGTGAT AAAAAAGTTA AAGATGGTGA GATCAACTTT
ATCGTTCCAA CTGAAATTGG GAAAGTAGAA ATTATTAAAA ATTTTACCGT CAATGAAATC
AAACAAGCAC TTCAGAAGTT AGCATCCAAA TAA
 
Protein sequence
MNQNSIRIKI KLAHNPYEVV IKKNGLARIG EELKKIGFKK ATKVLVVTNK DVSVHYGKEF 
IHNLSDNGFN PTLIEIKAGE ERKNLATISD IHNAAYTSRL ERGSLMIALG GGVIGDMTGF
AAATWLRGVS FVQVPTTLLA MVDASVGGKT GVNHPKGKNL IGAFHQPKLV LIDPITLKTL
PEREFKAGMA EVIKYGVISD KKLFRKLEDA PRLDKLETLT DRFLLEIIQR SVQTKAHIVE
LDEREGGIRA VLNYGHTFGH AIEALCGYGT WLHGEAVSMG MIAIGQLALE RNIWNISDLE
RQRKVLCQAG LPTIWPRVCA EDVIEILKSD KKVKDGEINF IVPTEIGKVE IIKNFTVNEI
KQALQKLASK