Gene Nmul_A1939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1939 
Symbol 
ID3784235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2229927 
End bp2231009 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content58% 
IMG OID637812025 
Productgalactose-1-phosphate uridylyltransferase 
Protein accessionYP_412626 
Protein GI82703060 
COG category[C] Energy production and conversion 
COG ID[COG1085] Galactose-1-phosphate uridylyltransferase 
TIGRFAM ID[TIGR00209] galactose-1-phosphate uridylyltransferase, family 1 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.509299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTATTCGC TTGAGCTCAC CAAGCCGGAT GGCCGTCGGC TGATTCTGTA TTCCCGGGAG 
GCTATTAATC CTGATATCGT TGCGCCCAGC CCTTTCACGG AGCCGCTGAA CGCGTATCCT
CACCTCAGAT GGCACCCGTT ACGCGGAGAG TGGGTGACGT ATGCAGCGTA TCGGCAGGGG
CGGACGTTCC TGCCACCCCC TGAATATAAC CCGCTCGGCA TAACAACTGA TCCGTCTCAC
CCTACCGAAG TGCCTGCCGG AAACTGGGAT GTGGCTGTCT TCGACAACCG TTTCCCGTCC
CTGGGGCCAA TAGGAGACCC TCCCTCGCTT ATCGTGCCAA GCGCGCCGGC TGTAGGGCAT
TGTGAGGTAG TAGTTTTTAC GAAAGACGCC AAAGCTTCGC TGGGTGCCTT GTCGCTCGCT
CACATCGAAT TATTGATGGA AGTATGGGCA GAGCGTACCG CTGTGATGGC GCGGCGAGAG
GATATTGCAT ATGTACTGCC ATTCGAGAAC CGGGGTGCGG AGGTTGGTGT AACGCTGCAT
CATCCCCATG GTCAAATCTA CGCCTACCCC CTGGTGCCGC CTGTCCCCAG CCGCATGCAG
CAGGAGGCAC AACACTACTA TGAAGCGAGG GGAAGTGGGG TGCTGCAGGA CTTCATTATC
GCGGAGCGGG AGGCGGGTGT ACGCATGCTT TATGAGGGTG AACACGCTGT TGCCTTCGTC
CCGGTATGCG CGCGTTATCC GTACGAAGTC TGGCTTGCCC CGACGGAACC GGTTGAAAGC
TTTGTCCACC TCACGCCAGA TCAACGGTTA GACCTGGCGC GGGCATTGAA AACGGTTCTG
CTCAAGTATG ATGGCCTCTG GCAGAGGCCT TTTCCCTACC TGATGGCGTG GTACCAGGCC
CCGGTTGATG GAAAAGCCCA TCCCGAAGCG CACCTGCACG CAGAATTCTA CCCCCCCTAT
CGCACCCGTG AGCGGCTCAA ATATCTTGCA GGAACTGAAA TTGCAGCAGG CTTCTTTGCG
ATGGACGCGT TGCCTGAGGA AAAAGCGCGC GAGCTTCAGC AGGTAGAGGT TACTGTCGAA
TGA
 
Protein sequence
MYSLELTKPD GRRLILYSRE AINPDIVAPS PFTEPLNAYP HLRWHPLRGE WVTYAAYRQG 
RTFLPPPEYN PLGITTDPSH PTEVPAGNWD VAVFDNRFPS LGPIGDPPSL IVPSAPAVGH
CEVVVFTKDA KASLGALSLA HIELLMEVWA ERTAVMARRE DIAYVLPFEN RGAEVGVTLH
HPHGQIYAYP LVPPVPSRMQ QEAQHYYEAR GSGVLQDFII AEREAGVRML YEGEHAVAFV
PVCARYPYEV WLAPTEPVES FVHLTPDQRL DLARALKTVL LKYDGLWQRP FPYLMAWYQA
PVDGKAHPEA HLHAEFYPPY RTRERLKYLA GTEIAAGFFA MDALPEEKAR ELQQVEVTVE