Gene Amir_4100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4100 
Symbol 
ID8328293 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4817745 
End bp4819292 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content73% 
IMG OID644944565 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_003101802 
Protein GI256378142 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0908626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCCC ACTACAACGA CATCATCGTC GGAGCGGGAT CGGCCGGTTC GGTGCTCGCG 
GCCAGGCTCA GCGAGGACCC GGCCAGGCGG GTGCTGCTGC TGGAGGGCGG ACCGGACCAC
GCCCGGCCGG ACGAGCTGCC CGAGGAGGTG TTCCACGGCA GGTCCATGTC CTTCACCGGG
CAGGACTGGG GCTTCCGCGC CGACGTGCAC GACGGCCGCC GCATCCGCTA CCCGCGCGGC
AAGCTCACCG GCGGCTCCTC CGCGGTCGGC GCGACGGTCG GGCTGCGCGG CGTGCCCGCC
GACTACGACG ACTGGGCGGC GGCGGGCAAC CCCGCCTGGT CCTACGAGGG GGTGCTGCCC
TACTTCCGCA GGCTGGAGAG CGACGGGGAC TACGGCGAGA GCGAGTTCCA CGGCGGCTCC
GGCCCGATCC CGATCCGGCG CTGGAGCGCG GACGAGCTGC CGATCGGGCA GACCGCGTTC
ACCCAGTCCT GCCTGGAGCA CGGGTTCGCC GAGGTCGCCG ACCACAACCA CCCGGAGGCC
ACCGGCGTCG GCTCCATCCC GTCCACCCGG CACGACCGGG ACCGCAGGGC CACCACCGCC
AGCACCTACC TGGCGCTGAC CAGGGGCAGG GCCAACCTGG AGCTGGCGCC CGGCCTGCTG
GTCGACCGGG TGGTGTTCGA CGGGCAGCGG GCGGTCGGCG TGCTGGTCGG CGCGCCCGGC
GCGGAACCGG AGCTGGTGCG CGGCGACCGG GTGCTGCTGG CGGCGGGCGC GATCGGCACG
CCCGCGATCC TGCTGCGCTC GGGCGTCGGA CCCGCCGAGG ACCTGCGGCG GCTGGGCGTG
GACGTGCGCG CCGACCTGCC GGGCGTGGGC GCGAACCTGG TCGACCACCA GCGCACCGGC
GCGTTCCTGG TGCCCGAACC GGGCTCGGTG GACCGCACCG AGGCGTTCCT GCAGCAGATC
CTGCGCACCA CCTCACCGGT CACCGGCGAC TTCAACGACC TCCAGTACTA CATGGTCAAC
CACTTCGGCC TCGGCCCGTT CCCGGAGCTG CAGATGCTCG CGGGCACCAC CGAGATCCTG
GGCGTGATGG TCGTGGCGCA GCGCCCCGGC TCGCGCGGCC GGGTCGCCGT CGACTCCACC
GACCCGCGCG CGGCCCCGGT GATCCGGTTG AACTTCCTGG ACGACGAGCG CGAGCTGGAC
GTGCTCGTGG ACGGGGTGCG CACCGCGTGG CGGCTGGCGC ACCACCCGGA CGTCCTCAAG
CTGGGGCAGG GCTTCGTGGT GCTGCGCGAC GCGATGATCG ACAACGACGA CATGGTGCGG
CAGTACGTCA AGACCAGCCT GGAGAGCGCT TACCACCCGA CCGGCACGGT CCGCATGGGA
CCGGCGTCCG ACCCGTCGAC CGCGGTGGAC GAGCGGGGCG CGGTGCACGG GCTGGAGGCG
CTGCACGTGT GCGACGCCTC GATCATGCCG AACACCGTGC GCGCCAACAC CAACCTCACC
TCCATCATGA TCGGCGAGCG GATGGCCGAC TGGCTGCGCG CCGGCTGA
 
Protein sequence
MTAHYNDIIV GAGSAGSVLA ARLSEDPARR VLLLEGGPDH ARPDELPEEV FHGRSMSFTG 
QDWGFRADVH DGRRIRYPRG KLTGGSSAVG ATVGLRGVPA DYDDWAAAGN PAWSYEGVLP
YFRRLESDGD YGESEFHGGS GPIPIRRWSA DELPIGQTAF TQSCLEHGFA EVADHNHPEA
TGVGSIPSTR HDRDRRATTA STYLALTRGR ANLELAPGLL VDRVVFDGQR AVGVLVGAPG
AEPELVRGDR VLLAAGAIGT PAILLRSGVG PAEDLRRLGV DVRADLPGVG ANLVDHQRTG
AFLVPEPGSV DRTEAFLQQI LRTTSPVTGD FNDLQYYMVN HFGLGPFPEL QMLAGTTEIL
GVMVVAQRPG SRGRVAVDST DPRAAPVIRL NFLDDERELD VLVDGVRTAW RLAHHPDVLK
LGQGFVVLRD AMIDNDDMVR QYVKTSLESA YHPTGTVRMG PASDPSTAVD ERGAVHGLEA
LHVCDASIMP NTVRANTNLT SIMIGERMAD WLRAG