Gene GM21_3352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3352 
Symbol 
ID8138719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3878910 
End bp3880007 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content64% 
IMG OID644870970 
Productaspartate-semialdehyde dehydrogenase 
Protein accessionYP_003023135 
Protein GI253701946 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0136] Aspartate-semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01745] aspartate-semialdehyde dehydrogenase, gamma-proteobacterial 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value0.0858212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTCG GACTGGTCGG TTGGCGTGGC ATGGTAGGCT CCGTTTTGCT CCAGCGCATG 
CAGGAGGAAA ACGATTTCCA GGGAATAGAG CCGGTTTTCT TCACAACCTC GCAGGTGGGG
CAGCCCGCCC CGATGAACGC CGGGACCCTG AAGGACGCCT CGGATATCAA CGAGCTTAAG
AAGCTGGACG TGATCATCAC CTGCCAGGGT GGCGACTACA CCAAGGCGGT GCGCCCGGAG
CTGAACAAGG CGGGATGGAA GGGGTACTGG ATCGATGCTG CCAGCACGCT CCGCATGGAG
AACGACGCCG TCATCATCCT CGACCCGATC AACCGCAACG TCATCGACGC TGCCCTTGCC
AAAGGGGTCA AGGACTACAT CGGCGGCAAC TGCACCGTGA GCCTCATGCT CATGGGCCTG
GGCGGGCTCT TCAAGGCGGG TGCCGTCGAG TGGATCAGCT CCATGACCTA CCAGGCGGCC
TCCGGCGCGG GCGCTCCCAA CATGCGCGAG CTCCTCTCCC AGATGGGCGT ATTGCAGGGC
TCGGTAGCGG ATCTCCTGGC GACCCCGGGC TCCGCCATCC TGGAGATCGA CCGCAAGGTG
ACCCAGACCC TGAGGGGAGG GGATCTCCCG ACCAAGGAGT TCGGTTTCCC GCTGGCGGGG
AGCGTCCTTC CCTGGATCGA CCGCGAGGTC GAGGACGGGC AGAGCCGCGA GGAGTGGAAA
GGGTACGCCG AGACCAACAA GATCCTCGGC ACCGCGAACC CGATCCCGGT CGACGGCATC
TGCGTCCGCG TGGGCGCCAT GCGCTGCCAC AGCCAGGCGC TGACCATCAA GCTGAACAAG
GACATCCCCA TCGGCGAGAT CGAGCAGATG ATCAAGAACG ACAACCAGTG GGTCAAGTTC
GTCCCCAACA CCAAGGCGGA GACCCTGGCT CAGTGCACCC CGGCAGCCGT TTCCGGTTCG
CTCACCGTGC CGGTAGGCCG CGTGAGGAAG ATGAAGATGG GGCCGCAGTA TCTCTCCGCC
TTCACCTGCG GCGATCAGCT CCTTTGGGGC GCCGCAGAGC CGCTGCGCCG CATGCTTCAG
ATACTCAAGG AGCGGTAA
 
Protein sequence
MKVGLVGWRG MVGSVLLQRM QEENDFQGIE PVFFTTSQVG QPAPMNAGTL KDASDINELK 
KLDVIITCQG GDYTKAVRPE LNKAGWKGYW IDAASTLRME NDAVIILDPI NRNVIDAALA
KGVKDYIGGN CTVSLMLMGL GGLFKAGAVE WISSMTYQAA SGAGAPNMRE LLSQMGVLQG
SVADLLATPG SAILEIDRKV TQTLRGGDLP TKEFGFPLAG SVLPWIDREV EDGQSREEWK
GYAETNKILG TANPIPVDGI CVRVGAMRCH SQALTIKLNK DIPIGEIEQM IKNDNQWVKF
VPNTKAETLA QCTPAAVSGS LTVPVGRVRK MKMGPQYLSA FTCGDQLLWG AAEPLRRMLQ
ILKER