Gene Moth_2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2043 
Symbol 
ID3831189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2133173 
End bp2134438 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content66% 
IMG OID637829972 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_430882 
Protein GI83590873 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.453314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATCC TGGTAGTTGG CGGCGGCGGC CGGGAACACG CCCTGGTCTG GAAGATTGCC 
CGGAGTCCCA GGGTTACGGA AATCTACTGC GCCCCGGGCA ACGCCGGCAT CGCCGGCCTG
GCCTACTGTG TGCCCATTGC TGCCGATGAT ATTACCGGCC TGCTGGACTT CGCCCGTCGG
GCTGCCATAG ATCTCACCGT CGTCGGTCCC GAGGCCCCCC TGGTGGCCGG TATCGTCGAC
GCCTTCCAGG CGGCCGGCCT GCCAATCTTT GGCCCCTCCC GGGGAGCGGC CCGGCTGGAA
GGAAGCAAGA TTTTCGCCAA AGAGTTGATG CAGGAAGCAG GCATCCCCAC GGCTGCCCAC
GCTACCTTTA CGGAGCCCGG TCCGGCCCTG GCCTACCTGG AGGAACACCC CGGACCGGTG
GTGGTCAAGG CCGACGGCCT GGCAGCCGGC AAGGGCGTGG TGGTGGCCCC GGACGCGGCT
ACGGCCCGAG CAGCGGTCCG GGATATGTTT GGAGGCAAAT TTGGTCCCGC CGGCCGGAGG
GTAATCCTCG AAGAACGCCT GGAAGGGGAA GAAGTCAGCA TCCTGGCCCT GTGCGATGGG
AAAACAGCTA TTCCCCTCCT CCCTTCCCAG GACCATAAGC GGGTGGGGGA AGGGGATACC
GGCCCCAATA CCGGCGGCAT GGGCGCCTAC GCGCCGGTGC CCTTCTATAC CCCGGAGATT
GCCGCCCAGG TGGAAGAAAG GGTTCTCCGG CCGGTTATCC GGACCATGGC CGCAGCCGGG
CACCCCTACC GGGGCGTCCT TTACGCCGGG CTGATGCTTA CCCGGGAAGG TCCCAAAGTC
CTGGAGTTCA ACTGTCGCTT CGGCGACCCG GAAACCCAGC CCCTGATGCT CCTTTTGGCA
AGCGACCTGG TGGAGCTGAT GCTAGCTGCC GTCAACGGTG AGCTGGCGGG AACGAGGATC
GCCTGGTACC CCGGTGCCGC CGCCGGGGTG GTCCTGGCGG CGGGCGGTTA TCCGGGCCCG
TACGCCAAGG GTGATTTTAT TACGGGATTG GAGGCAGTGG CACCGGGGGT GGAAGTCTTC
CACGCCGGCA CCGCCCTGGT GGACGGCCAG GTCGTAACGT CCGGGGGCCG GGTGGTGTGT
GTAACCGCCC GGGGTAAAGA TTTACGGGCC GCTTTGGACC GCGTTTATGA CAGTATCAGG
GCCATCCACT TCCAGGGAAT GCACTACCGC CGGGATATCG GCCGTCGGGC CTTGGAGGCA
ACTTAA
 
Protein sequence
MRILVVGGGG REHALVWKIA RSPRVTEIYC APGNAGIAGL AYCVPIAADD ITGLLDFARR 
AAIDLTVVGP EAPLVAGIVD AFQAAGLPIF GPSRGAARLE GSKIFAKELM QEAGIPTAAH
ATFTEPGPAL AYLEEHPGPV VVKADGLAAG KGVVVAPDAA TARAAVRDMF GGKFGPAGRR
VILEERLEGE EVSILALCDG KTAIPLLPSQ DHKRVGEGDT GPNTGGMGAY APVPFYTPEI
AAQVEERVLR PVIRTMAAAG HPYRGVLYAG LMLTREGPKV LEFNCRFGDP ETQPLMLLLA
SDLVELMLAA VNGELAGTRI AWYPGAAAGV VLAAGGYPGP YAKGDFITGL EAVAPGVEVF
HAGTALVDGQ VVTSGGRVVC VTARGKDLRA ALDRVYDSIR AIHFQGMHYR RDIGRRALEA
T