Gene Nmul_A1702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1702 
Symbol 
ID3784801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1941849 
End bp1943831 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content54% 
IMG OID637811789 
ProductPhage integrase 
Protein accessionYP_412392 
Protein GI82702826 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACA GAGTCAACTT GCATCTCGGG GGCAATTCAC ACCAAATGGG ACGGATTGCC 
GATAAGAATG AATTCCCTTG GTATCAATCT CGCAGGTTCA ACATTTTTGG ACTTGTATTT
CTGACAAGCG CCGCTATCAG TCTCGCTTAC GTTTACAGCC AGCCGGCAAT TTATCGCAGC
AGCGCGACAT TGCTTACTTC GGCCATGACG CCCATCGACC AGCAGAGCGA TGATGCCGAT
ATTCAACACG TGGCAATCCA GCGCCAAATC CTGCTGGGGC GGGAGCTTGC CGCTGAAACC
CTGAGGAAAT TGAAGGCTTC TCCCGCCGGT AAATCCTTAA ACCGGTTGAC TGAATCCGAT
ATTCAGACTT TGTTGACCGT CGAGCCTGTT CCAGAAACCA ATCTCGTCGA GGTTGCGGCA
GAAGGCTCGG ATCCCCGCTT TCTGCCTTTG CTCATCAATA CATGGATAGA CGTATACCTG
GAAGCGCGAA GTGAAGAAGT AAAACGTTTG AAGAGCAATA CCGAGCGTAT TGTCGAGGGT
GAACTGGAGG AGTTGGCTGA CAAGGTTGCT GCCGCGAGGA CGGCGCTGGA GAACTTCCGT
GAAAACAATA ACATTCTATC GACAGAGCGC GAGGAGAATG AACCATCCGC CCGGCTGAAG
GGGCTGAACG AGTCGCTTAA CAAGGCTTCC GAGGCGGAGG TCAAGGCGAA AGCGGAACTG
GATGCCATCA GAGCTGCAAT TGCCCGGGGA AAGGCAGTGG TTCCCGACGA TGAGAAAATC
AGCTTGGCCG ACCTCGAGAA GCGTCTGCAT GATTCACGTG AAAAACTGGC AGAATTCGAT
AAAAAATTTA CCCGCGAGTA CTTAAACCTG CAGCCCGACC TGAAGTCTCT TCCTGAGCAA
ATCAAGGAAC TCGAGACTGC TATCAACAAT AAGCGGTTGC GGGGTCAAAA TGTAGCTTTG
AATGAAGCCG AAGTGAATTA CGCGGCCGCA CGACAGACCG TCAGGGATAT TCGCGAGCAA
TTGGGCGAGC ACAGGCAGCG GGCATCGGCC TTCGCGACAA AATTTACCCA GCACGAGGCG
TTGAAGACCG ATCTGGAAGG ACTGGAAAAG CTGTATCGGG ATACGCAGGA ACGGCTGGTA
CAGGTGAAAA CCGGCCGCAA GGACAAATAT CCACAGGTCG ATGTGATTAG CAGGGCATAT
GAATCAAGGG AGCCAGTAAG ACCGAACTAC AGTCGTGAAG CCCTGATAGC GCTCGGCGGC
TCCCTGCTTC TGGGATTGAT CAGCGTGTGG GTTTTTGAGT ATCTGACCCA AAAGAAGGAA
CAATCGCCTT CCATCGCAGT ATTCGGTGTG GGGAGATATA CCCCGCCTGC TACCGAGCTT
ATTGATCATC CTCACGCCGC GCTCGGGACC AAAGCCGAAC CGTTAGAGCA GAAAACCAAC
TTTGCCTTGC CAAAACCGGT GCACCGGGAA CTTTCGAGCC ATCAGTTACG GACCCTGATC
AATGCCGCCA ACCTGAAGGG GAAGCAGCTT ATCGGTTTGC TGCTCAGCGG CCTCGACATC
AATGAAGCGG CGTCCTTGAA GGCGAATCAG ATCAACGAGC GGGCCAATGT CATCAACCTC
GAAGGAAGAA CTCCGAGGGC GGTGCCCCTC AGCAATCCGC TCAAATCCTT GCTCCAGCAT
TCCGGTAACC GCCCGGTATG GGATCCGGAT GATTCGCAGA CACGCGTCGA TCTGGCAACC
GCCCTGGTTT GCGCCGCAAT CGATTCGGGA TTGCCTGACC CCCAGGAAAT CACCGCCGAG
ACGATCCGGC ATAGCTACAT TACTTATCTT GTACGCCAGG GCTTGCGGCT TTCCGAGCTG
GAGCAGATCG TCGGTCATCT GGACCCTTCG GTGATTTCGA GTTATGGCAC CTATTCGCCG
CCCCAGCAAG GCCGTCCTCT GCACGAGATC GAAGTGCTGC ATCCAGCGTT GCTCATAATC
TGA
 
Protein sequence
MKNRVNLHLG GNSHQMGRIA DKNEFPWYQS RRFNIFGLVF LTSAAISLAY VYSQPAIYRS 
SATLLTSAMT PIDQQSDDAD IQHVAIQRQI LLGRELAAET LRKLKASPAG KSLNRLTESD
IQTLLTVEPV PETNLVEVAA EGSDPRFLPL LINTWIDVYL EARSEEVKRL KSNTERIVEG
ELEELADKVA AARTALENFR ENNNILSTER EENEPSARLK GLNESLNKAS EAEVKAKAEL
DAIRAAIARG KAVVPDDEKI SLADLEKRLH DSREKLAEFD KKFTREYLNL QPDLKSLPEQ
IKELETAINN KRLRGQNVAL NEAEVNYAAA RQTVRDIREQ LGEHRQRASA FATKFTQHEA
LKTDLEGLEK LYRDTQERLV QVKTGRKDKY PQVDVISRAY ESREPVRPNY SREALIALGG
SLLLGLISVW VFEYLTQKKE QSPSIAVFGV GRYTPPATEL IDHPHAALGT KAEPLEQKTN
FALPKPVHRE LSSHQLRTLI NAANLKGKQL IGLLLSGLDI NEAASLKANQ INERANVINL
EGRTPRAVPL SNPLKSLLQH SGNRPVWDPD DSQTRVDLAT ALVCAAIDSG LPDPQEITAE
TIRHSYITYL VRQGLRLSEL EQIVGHLDPS VISSYGTYSP PQQGRPLHEI EVLHPALLII