Gene Mlg_1399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1399 
Symbol 
ID4270621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1603920 
End bp1604990 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content68% 
IMG OID638126155 
ProductnifR3 family TIM-barrel protein 
Protein accessionYP_742238 
Protein GI114320555 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.372686 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACAC TTAGCCCTGA CTGGCGGGGA CCGACGCGCT CAGGGTATGC TCGCGCTCGA 
ACAGCGCAAC CTGATTGGAA CATGAACATC GGCCCTTGGA AACTCTCCGG CCGGGTGCTG
CTCGCACCCA TGGCCGGCAT CACCGACCTC CCCTTTCGTC AGCTCTGTCG CCAATGGGGT
GCCGCCCTCG CGGTTTCGGA GATGCTGTCC GCCGATCCAA CCCTGCGCAA GACGCGCAAG
AGCCAATGGC GCGCCACCCT CGCCGACGAC GAGTGCCCAC GGGTGGCGCA GATCGCGGGC
GCGGATCCGG TCGCCCTAGC GGAGGCGGCC CGCTACAACG TACAGCGGGG CGCCCAGGTC
ATCGACATCA ACATGGGCTG TCCGGCAAAG AAGGTCTGCA ATCGCATGGC CGGCTCCGCA
CTGCTCGCGG ACGAGCCCCT GGTGCGCCGG ATCCTGACCG CGGTCGTGTC CGCGGTGGAG
GTCCCGGTCA CCCTGAAGTA TCGCACCGGT CCGTCGCCCG AGCGGCGCAA TGCCGTGGCC
ATCGCCCGGA TGGCGGAGGA CGCCGGGGTG GCCGCACTGA CGCTTCACGG CCGGACACGG
GTCCAGGCCT ACCAGGGTCA GGCCGAATAC CGCAGCGTGG AGGCGGTCTG TCGGGCGGTG
GACATCCCGG TCGTCGCCAA CGGCGATATC GATAGCCCAG ACAAGGCGCG GCAGGTGCTG
GACGAAACCG GCGCCGATGC GGTCATGGTC GGGCGCGCGG CCCAGGGCCG ACCCTGGCTC
TTCCAGGCGA TTCACACCTA TCTGGAGACC GGTACGCGGG TCGCCACGCC TTCGCTGGCG
GTGCGCAAGC AGACCCTGCT GACCCACCTG CGCGAGATTC ATCGCTTTTA CGGCGACTGG
ATGGGGCCCC GCATCGCGCG CAAGCACATC AAGTGGTATC TACAGGCTCT GCAGGTGGAT
CGGTGCCATG TGCAGCCACT GATGCAGCCC ACTGCCCCGG AGGCGCAACG GGTGGCGGTT
GCCGATTGCC TGTCACGGCT CAACGAGGCG CCGGCGGCGG CGCCTGCATA G
 
Protein sequence
MQTLSPDWRG PTRSGYARAR TAQPDWNMNI GPWKLSGRVL LAPMAGITDL PFRQLCRQWG 
AALAVSEMLS ADPTLRKTRK SQWRATLADD ECPRVAQIAG ADPVALAEAA RYNVQRGAQV
IDINMGCPAK KVCNRMAGSA LLADEPLVRR ILTAVVSAVE VPVTLKYRTG PSPERRNAVA
IARMAEDAGV AALTLHGRTR VQAYQGQAEY RSVEAVCRAV DIPVVANGDI DSPDKARQVL
DETGADAVMV GRAAQGRPWL FQAIHTYLET GTRVATPSLA VRKQTLLTHL REIHRFYGDW
MGPRIARKHI KWYLQALQVD RCHVQPLMQP TAPEAQRVAV ADCLSRLNEA PAAAPA