Gene Nmul_A1444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1444 
Symbol 
ID3784637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1654339 
End bp1656375 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content53% 
IMG OID637811532 
Producthypothetical protein 
Protein accessionYP_412139 
Protein GI82702573 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGCAT TCAGGATTTC CGGATTCTCC GGGCTTGTTC CGCGGCTGGC AAAGCACTTG 
CTCAGCTCGA ACCAGGCGCA GACGGCGACC AATTGCAACC TTGCCGCAGG CGACTTGCGG
CCCCGAAACG CGCCGCTACT TGTTTTTTCC CCCCAGATCG ATGGAGAAAT CCGGTCGATG
TTCAGGATGG AAAAGGATGG GAGCGAAAAG TGGCTCGCCT GGAGCAGGGA TGTCGATGTA
GCCCGTTCGC CTGTTGCAGG GGATACGCTT CAGCGATTCT ACTACACGGG CGACGGGGAG
CCGCGCACTT CCAACTTTGA AATGGCAACA GCGGGCGCCA ATGCTCATCC GTCCGCCTGC
TATGTACTTG GGGTTACACC CCCGGTCAGC GAACCGCTGG TGAGCGCGTC GGGTGGCAGC
GGAGTAGCGA CTTCCCGTGC CTATGTATAT ACATTTGTCA CGCAATGGGG GGAAGAGTCG
CAACCTTCCC CCGCTTCGAT AGTGACCAGC GGAAAGATAG ATGCAACCTG GATGATTTCA
AATCTGGACG CAGCACCCCC CAATTCCGGG GCAATTACTG CTGTTTCCAG GAATTCTCCA
GTTGCCGGGC AAATGGAAAT CAGCCTTGAT ACCGTCTTCG GTTTAAGAGC GCACGAGGAA
ATCCAGTTTG AATCGGTGTC GGGCATGACT GACTTGAACG GTCGATTCAT TCTGATGAGC
GTGGATCCGG TAACCAAAAA GGTGGCCATA TCCCTTTCTA CAGACCAGCT CTACGCCGGA
GGTGGGAGGT GGGAGCGGCA GGCTCCACAT AATACCGGGG GGATGAACAA GCGCATCTAC
CGGACGCTTA CCACGTCGTC AGGGACCGAA TATCGCTATG TCGCAACACT TTCCGGGATT
ACAAAAAACT ACAGCGATAC TGTCCCTGAC ACGGTTATTG CGCTGGGAGA AACATTGCCC
TCCACAAACT GGGAAATGCC CCCGGCAAAC ATGAGAGGCA TCGTTGTACT TGCGAATGGA
ATTGCCGCAG CATTCGCGGG CAATGAGGTG CTTTTCTCGG AACCGTTCAA ACCTTATGCC
TGGCCCACTT CGTATCGGCA AACATACGAC CAGGAAATTG TAGCCATCGC TGCAATGGGA
ACCACACTCG TCGGCATGAC CAGGGGCAAT CCCTTCACCC TGACCGGCGT TGAGCCCGTA
ACCATGGGCG GAGGAATGGA GAAGCTGGGA GTGGCGTGGC CCTGCATGTC GAAACGAGGA
GTAGCGAATT TTGCATTTGG CATCGGGTAT CCCGCTCCGC AAGGGATGGT AATGATCGGG
ACAAGCAGCG ATATTGTCAC AAAAGATTTG TTTACCCAGA AGGAGTGGTC CGAACTAAAC
CCCGATACCT TTATCGCGAC CTCTGCCGAT AACCGCTATT ACTGCGGCTA TTCGGCTGGC
GACAGCTCCC TGATGTTCGT GATCGATAAG GCGGAGGATG CATCCTTTAC AAAAATCAAC
CAGAACATCA GTTGCATCTG GACGGATCCC ATCACCGGCA AGCTCTACAT CGCCACAAAC
AAGAAAATCT ACGAATGGGA AGGGGATACG GGAACTAAGC TTTTCTATGA GTGGAAAAGC
AAACGGTTCG TTACTGCGTC ACCGGTTAAT TATGGTGCGG GGAAGATCGA TGCCGATTTT
AAAATGACGG AAGAAGAAAG AGCAGCGGCG CAATCCTCCT ATAAGGAGAC TATCGCTGCC
AACCAGACAT TGATCAGTTC TTATTCCATG GATGACGGAC TGGCAGATAC ATGCCTCGGT
GAATACGAGA TCGGGGGTGA TGCGACTCAG GATATTCCCC TTTTATCCAT AGATTCTCTG
CAATTTCAAT TATGGAGCGA TGGCGTGCCG AAATTTACCA AACAGGTCAA GAATAACAGG
GCATTTCGGC TTCCCGGTGG CTATAAGGCC GATAACGTAG AGTTTGTGCT ATCCGGCAAT
GTGAAGGTAA ACAGCGTCGT CCTGGCTGAA ACAATGGATG GATTGAAGCA GGCATAG
 
Protein sequence
MSAFRISGFS GLVPRLAKHL LSSNQAQTAT NCNLAAGDLR PRNAPLLVFS PQIDGEIRSM 
FRMEKDGSEK WLAWSRDVDV ARSPVAGDTL QRFYYTGDGE PRTSNFEMAT AGANAHPSAC
YVLGVTPPVS EPLVSASGGS GVATSRAYVY TFVTQWGEES QPSPASIVTS GKIDATWMIS
NLDAAPPNSG AITAVSRNSP VAGQMEISLD TVFGLRAHEE IQFESVSGMT DLNGRFILMS
VDPVTKKVAI SLSTDQLYAG GGRWERQAPH NTGGMNKRIY RTLTTSSGTE YRYVATLSGI
TKNYSDTVPD TVIALGETLP STNWEMPPAN MRGIVVLANG IAAAFAGNEV LFSEPFKPYA
WPTSYRQTYD QEIVAIAAMG TTLVGMTRGN PFTLTGVEPV TMGGGMEKLG VAWPCMSKRG
VANFAFGIGY PAPQGMVMIG TSSDIVTKDL FTQKEWSELN PDTFIATSAD NRYYCGYSAG
DSSLMFVIDK AEDASFTKIN QNISCIWTDP ITGKLYIATN KKIYEWEGDT GTKLFYEWKS
KRFVTASPVN YGAGKIDADF KMTEEERAAA QSSYKETIAA NQTLISSYSM DDGLADTCLG
EYEIGGDATQ DIPLLSIDSL QFQLWSDGVP KFTKQVKNNR AFRLPGGYKA DNVEFVLSGN
VKVNSVVLAE TMDGLKQA