Gene Nmul_A1108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1108 
Symbol 
ID3785688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1276275 
End bp1277390 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content53% 
IMG OID637811193 
Productputative membrane-bound dehydrogenase oxidoreductase protein 
Protein accessionYP_411803 
Protein GI82702237 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATCA CAAAAATTCT CATACGTTTC ATGGCCACTT GCCTGCTCTG TTATGGCTTA 
CCTGCGATGG CAGACAGCAA AACCGGCAAG CTGCCGCTGG ACAAGATCAA GTTACCGGCC
GGATTTTCGA TCGAAGTGTG GGCTGAAGTA CCGAACGCGC GGGGACTGGC ACTGGGAAAA
AACGGCACGG TGTTTGCAGG CTCGATGGAC GAGGGGAATG TCTATGCCAT CAGGGACAAT
GGAGGGGAGC GTGAAGTAAA AACCATCGCC AGAGGGCTTA ATCTTCCCAT AGGGGTCGCC
TTTCGCGATG GCGCGTTATA TGCCTCCTCG GTCGACCGCA TCCTGCGTTT TGACGGTATT
GAGGAAAAAC TCGATCAGCC CGGAAAACCG TACGTTGTCA CCGAACGATT TCCCAACGAG
AAACACCATG GGGGCAGATA TATAGGTTTT GGCCCGGATG GTCTTCTGTA TGTGGCGGTG
GGCGCTCCAT GCAATGTCTG CGAAACGGAG CCGGAGAAGT TTTCGCTGAT TTCCCGCATC
AATCCGGATG GCTCGAACTA TGAGGTCTTC GCATACGGGA TACGAAACTC GGTAGGATTC
GATTGGCATC CCGAAACAAA GGAATTGTGG TTTACCGATA ACGGTCGGGA CTGGATGGGA
GACAACTTGC CGCCGGACGA ACTCAACCGT GCGCCAAAAA AGGGAATGCA TTTCGGCTAT
CCCTACTGCC ATTCCGACGA CATCCTTGAC CCGAAATACG GTGCAAAGCG GGATTGCCGC
AAGCTCGTTT CACCCGCAGC GAAACTTCCC CCGCACGCAG GCGCCCTGGG GATGCGCTTC
TATACAGGGA CCATGTTTCC TCCCCAATAT CGCAACAGCA TCTTTATAGC CGAACATGGC
TCATGGAACC GGCGCAATAA AATCGGCTAC CGCATCGAGT TCGCAAAGAT AAAGAACAAC
AAGGTGATAA AGCAGGAAGT GTTTGCCGAA GGCTGGCTGG AAAATGAAAA AAATTGGGGA
AGACCCGTGG ATGTGCTGGT GATGCCGGAC GGGGCACTGC TCGTGTCGGA TGATTTCGCC
GGGGTGATTT ATCGGATCAG CTACAGGAAA CCTTGA
 
Protein sequence
MPITKILIRF MATCLLCYGL PAMADSKTGK LPLDKIKLPA GFSIEVWAEV PNARGLALGK 
NGTVFAGSMD EGNVYAIRDN GGEREVKTIA RGLNLPIGVA FRDGALYASS VDRILRFDGI
EEKLDQPGKP YVVTERFPNE KHHGGRYIGF GPDGLLYVAV GAPCNVCETE PEKFSLISRI
NPDGSNYEVF AYGIRNSVGF DWHPETKELW FTDNGRDWMG DNLPPDELNR APKKGMHFGY
PYCHSDDILD PKYGAKRDCR KLVSPAAKLP PHAGALGMRF YTGTMFPPQY RNSIFIAEHG
SWNRRNKIGY RIEFAKIKNN KVIKQEVFAE GWLENEKNWG RPVDVLVMPD GALLVSDDFA
GVIYRISYRK P