Gene GM21_2660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2660 
SymbolaroB 
ID8138002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3095978 
End bp3097066 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content63% 
IMG OID644870264 
Product3-dehydroquinate synthase 
Protein accessionYP_003022454 
Protein GI253701265 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.000170461 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGATAGCTG AAAAGATCAG GGTCGCGCTC GACGAACGGA GCTACGACAT CGAAATGGGC 
GCCGGCAATC TCGACAGAAT CGGTTCCCTT TGCCGCGAAG TCGGTCTCTC CGGAACGGCG
GCGGTGGTCA GCAACACCAC CGTGGCCCCT CTCTACTACG AAACGGTCCG CCTCTCCATG
GAGCGTGCAG GTTATCGGGT GGTGCCGGTA ACTCTCCCGG ACGGCGAGGG GTACAAGAAC
AGCGCCACGC TCAACCTGAT CTACGACGGC CTGGTCGACG CCTCGCTGGA CCGCGGCTCC
TTCATCCTGG CCTTGGGCGG AGGGGTGATC GGCGACATGG CCGGGTTCGC CGCCGCTAGT
TACCTGCGCG GCATTCCCTT CGTCCAGATC CCCACTACGC TCCTCTCCCA GGTCGACTCC
AGTGTCGGCG GCAAGACCGG CATCAACCAT CCCCGCGGCA AGAACCTGAT CGGCTCCTTC
TACCAGCCGA AAGCGGTACT CATCGACGTC GCCACACTCG ATACCCTCCC GGAAAGAGAG
TTCCTGAGCG GCCTGGGAGA GATCGTCAAG TACGGTGCGG TGCTGGACGG CGGCTTTTTC
GACTTCCTGG AACAAAACGC GAAACTGCTA TTGGCCCGCG ACAAGGAGGC CCTGATCCAG
GCGGTCAGCC GCAGCTGCGC CATCAAGGCG AAGGTCGTGG CGGAGGACGA ACGGGAAGGG
GGGGTGCGCG CTGTGCTGAA CTTCGGGCAC ACCCTGGGGC ACGCCGTGGA GACCCTTACC
GGTTACACCC GCTACCTGCA CGGTGAGGCG GTAGCTATCG GCATGGTACA GGCGGCGCGG
ATCTCCCAGC ACTACGGGTT CTGCTCACAG GCGGACCGGG AGCGCATCGA GGCTCTCATC
GTGGCGCTTG GGCTGCCGAT AGAGCTTCCT ATCTTCCCCG CCCAGCAGTA CAGGGAGGCG
CTCTCGCACG ACAAGAAGGT ACGCGACAAG GGGCTCTTGT TCATCTGCAA CCAGGGGATA
GGCGCCTACC GCATGGAAAG GCTCACAGAC CTTGGGGCGC TTCTGGAGAT CTGTGGCATA
GGAGAATGA
 
Protein sequence
MIAEKIRVAL DERSYDIEMG AGNLDRIGSL CREVGLSGTA AVVSNTTVAP LYYETVRLSM 
ERAGYRVVPV TLPDGEGYKN SATLNLIYDG LVDASLDRGS FILALGGGVI GDMAGFAAAS
YLRGIPFVQI PTTLLSQVDS SVGGKTGINH PRGKNLIGSF YQPKAVLIDV ATLDTLPERE
FLSGLGEIVK YGAVLDGGFF DFLEQNAKLL LARDKEALIQ AVSRSCAIKA KVVAEDEREG
GVRAVLNFGH TLGHAVETLT GYTRYLHGEA VAIGMVQAAR ISQHYGFCSQ ADRERIEALI
VALGLPIELP IFPAQQYREA LSHDKKVRDK GLLFICNQGI GAYRMERLTD LGALLEICGI
GE