Gene Namu_2473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2473 
Symbol 
ID8448084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2729176 
End bp2730210 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content73% 
IMG OID645041586 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_003201830 
Protein GI258652674 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.000066055 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0147407 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGGG CGGTGGCCGA GCCGAACGAC GACGTACGGC TGGAAACGGC CGCGATTCCC 
ACCCCGGCGC CGGGCGAGGT CCTCGTCCGC AGCACGCTGG TGGGGATCTG CGGATCCGAC
ACCCATGCCC TGGCCGGCCA CCACCCCTTC CTGACCAGCC GCTACCTGCC CGGCCACGAG
GCAACCGGCA CCGTCGTCGC GCTCGGCGAC GGCATCGAGT CGCTGTTCGT CGGGCAGCGG
GTCCTGCTCA AGCCCAACGT CGCCTGCGGC GACTGCGCGA ACTGCGCCGC CGGCCGGTCC
AATGCCTGTG CCCAGCTGTC CTGGATCGGC TGTGACCCCT CGCTGCATTG GGCCGGCGCG
ATGGCCGACT ACTTCGTCGC GCCGGAGCGG AACCTGTTCC CGGTGCCGGA CGGGGTCGAC
GACCGCACCG CGGTCCTCGT CGAATGCCTG GCCACACCCG TGCATGCGGT GCGCATCAGC
GGCGACCTGA CCGGCGCCCG GGTCGTGATC CTGGGCGCCG GCACCATCGG CGTGCTGTGT
GTCGTCGCCG CCCGGCACGC CGGTGCCGGC GCCATGGTGG TCACCGACCT GGACCCGGGC
AAGTTGGACC GGGCCAGGCG CGTCGGCGCC CACGGCGCGG TGCCGGCCGA CGACCCGGCG
GTGAACGAAC GGGTCCTGGC CCAGTTGGGT GGCCCGGCGG ACGTGGTGCT GGACTGCGTG
ACCAACGAAC GATCGTTGAA CCAGGCCGTG GCCCTGCTCC GGCGGGCCGG CACCCTGGCC
GTGGTCGGGG TGCCGCCGCG GGACGCGACG CTGCCCATGC CGCTGATCCA GGACTGGGAG
ATTCGCGTTC AGGGATGCGC CGCCTACACC GAGGCCGATA TCCGCACGGC CCTGCAGATC
GCCACCGACG CAGGCCTGCC GACCGACGAG ATCGTTGCGG CCACCTACGG TTTGGACGAG
GTGGCGAGCG CCTTCGGGCA GGCCGCGGCC GACAGCTCCG GCAAGGTGCT CATCGCCCCG
CCCCGGCGCG GTTGA
 
Protein sequence
MRRAVAEPND DVRLETAAIP TPAPGEVLVR STLVGICGSD THALAGHHPF LTSRYLPGHE 
ATGTVVALGD GIESLFVGQR VLLKPNVACG DCANCAAGRS NACAQLSWIG CDPSLHWAGA
MADYFVAPER NLFPVPDGVD DRTAVLVECL ATPVHAVRIS GDLTGARVVI LGAGTIGVLC
VVAARHAGAG AMVVTDLDPG KLDRARRVGA HGAVPADDPA VNERVLAQLG GPADVVLDCV
TNERSLNQAV ALLRRAGTLA VVGVPPRDAT LPMPLIQDWE IRVQGCAAYT EADIRTALQI
ATDAGLPTDE IVAATYGLDE VASAFGQAAA DSSGKVLIAP PRRG