Gene Nmul_A1433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1433 
Symbol 
ID3784626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1645228 
End bp1646802 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content44% 
IMG OID637811521 
Producthypothetical protein 
Protein accessionYP_412128 
Protein GI82702562 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.158089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCAAC CCCAAGAAAA TAAGCTTCCA GTTTTTCAAA TACTGGTTTT CTCCAGTCTT 
GTTGTGCTCG CACTTTTTCT ATGGCAGGGG CATAAAGGAT TCTCACTATG GGATGAAGGT
TTTCTTTGGT ATGGGGTTCA ACGTGTGATG CTAGGTGAGG TGCCTATCCG CGATTTCATG
GCTTACGACC CCGGCCGTTA CTACTGGTCC GCTACACTCA TGTGGCTGTG GGGAGACAAT
GGCATAGTGG CCTTAAGGGG TAGCTTGGCG GTTTTTCAAG TGATGGGATT ATTCGTCGCT
CTACTGTTAA TTGCTCGAAA TACAAGAACG CTAAATTTTC CTTATTTACT TCTTTCAGCC
ATCACATTGG TGGTCTGGAT GTATCCACGC CACAAATTAT TTGATGTCTC TTTATCTATT
CTGCTGATTG GAGTATTGGC CTTTCTTGTG CAGAACCCTA CAAGGAGACG TTACTTTTTC
ACCGGTTTAT GTGTAGGTTT TGTAGCTGTT TTTGGCCGTA ACCATGGGGT ATACGGTGTC
TTAGGTAGTT TTGGGGTTAT GATATGGCTG ACCATCAGGC AAGCGGATAA GCTTGAATTT
ATCAAGGTGG CTATGCTATG GGCAGTAGGA GTAGCAATTG GTTATATTCC AATACTTCTC
ATGATATTGC TGGTACCAGG CTTTGCTCCT GCCTTCTGGG AAAGCTTGCT CTTTTTCCTT
GAAATTAAAG CAACTAATCT TACTCTACCC GTTCCTTGGC CTTGGCGTTT GGAATTTGAC
TCCGTATCTA TTGGTAAGAC GATTCGTGGA GTGCTGGTTG GCTTGTTTTT CATCGCTATA
GTCGTTTTTG GCGTACTTGC TATCATATGG GTTACTCGCC AGAAATTTCA CAAGCGGGCT
GTTCCATCGG CCTTGGTTGC AACTGCATTC TTGGCATTGC CCTATGCGCA TTATGCTTAT
TCCCGAGCTG ATGTAAGTCA TCTTGCTAAA AGCATTTTTC CTCTATTAGT CGGTTGCCTA
GTGCTGTTGT CCACAAAACC AGCGAGGATC AAATGGCCGT TGGCACTTTT GTTATGTGGG
TCAAGTTTAT TAGTGATGGT GCATTTTCAT CCCGCCTGGC AATGTCGGCC TAGCAAACAA
TGTGTGAGCA TCGTAATTTC AGACACCAAA GTGACTGTTG ATGCTCGCAC AGCGAGTGAG
ATCAGTCTAT TAAAGAAATT AGTTGCTAAG TATGCAGCCG ATGGTCAAAG TTTTATCACA
ACTCCTTTCT GGCCGGGAGC TTATCCCCTG TTCGAAAGAA AGTCTCCTAT GTGGGAGATA
TACGCCTTGT TCTCGCGAAG TGAAAGCTTT CAACAGCTAG AAATTGAACG AATCAAGGTG
ACAAATCCAG GTTTTATCCT GATATTCGAT TTCCCTCTTG ATGGGCGGGA GGAGTTACGT
TTCTGCAATA CACATCCCCT AATTCATAAA TATATCTCGG ATAACTTCGA GATGCTGCAC
GATTCGCCAA ACCTGATCTA TCAAATATAT ACAGTCAAGA AGACAATCTT ATCAGAGCAT
TCTGGATCCC CCTAA
 
Protein sequence
MHQPQENKLP VFQILVFSSL VVLALFLWQG HKGFSLWDEG FLWYGVQRVM LGEVPIRDFM 
AYDPGRYYWS ATLMWLWGDN GIVALRGSLA VFQVMGLFVA LLLIARNTRT LNFPYLLLSA
ITLVVWMYPR HKLFDVSLSI LLIGVLAFLV QNPTRRRYFF TGLCVGFVAV FGRNHGVYGV
LGSFGVMIWL TIRQADKLEF IKVAMLWAVG VAIGYIPILL MILLVPGFAP AFWESLLFFL
EIKATNLTLP VPWPWRLEFD SVSIGKTIRG VLVGLFFIAI VVFGVLAIIW VTRQKFHKRA
VPSALVATAF LALPYAHYAY SRADVSHLAK SIFPLLVGCL VLLSTKPARI KWPLALLLCG
SSLLVMVHFH PAWQCRPSKQ CVSIVISDTK VTVDARTASE ISLLKKLVAK YAADGQSFIT
TPFWPGAYPL FERKSPMWEI YALFSRSESF QQLEIERIKV TNPGFILIFD FPLDGREELR
FCNTHPLIHK YISDNFEMLH DSPNLIYQIY TVKKTILSEH SGSP