Gene Nmul_A1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1040 
Symbol 
ID3785167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1202564 
End bp1204138 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content54% 
IMG OID637811124 
Productarginine decarboxylase 
Protein accessionYP_411735 
Protein GI82702169 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1982] Arginine/lysine/ornithine decarboxylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACCGGA TTTTGAATGC CCAGCTTCAG GAAAAAGCGC GCACGCCTTT TTACGATCAA 
CTCAAAAGTT ATGTATTGAT GGCAAAGGAT GCCTGGCATA CACCTGGGCA TTCTTCAGGC
GATTCGTTGC GGGACAGTCC CTGGGCCAGC GATTTCTACC AGTTTATCGG CGAGCATATT
TTTCGCGCAG ATCTGTCGGT GTCGGTGCCC ATGCTCGATT CGCTCATGGA ACCTTCCGGG
GTCATTGCCG AAGCGCAGAA GATTGCGGCA AAGGCGTTTG GCGCGCGCCG CACTTTTTTT
GCCACAAATG GCACTTCCAC CGCCAACAAG GTGATATTCC AGACGTTGCT CGCTCCTGGC
GAAAAGCTGC TGCTGGATCG GAACTGCCAT AAATCGGTTC ATCACGGAGT CGTGCTATCC
GGCGCCCATC CCATCTATCT CAACTCCTCG GTAAACAAGA AATTCGGGGT TTATGGGCCG
GTGCCCAAGC AGACACTGTT CAGGGCAATC GAAGAACATC CCGATGCCCA GGCGCTCATA
CTCACGAGTT GCACCTATGA TGGTTTCCGC TATGACCTGC CTCCGATCAT AGAGGCCGCG
CATGCCAAAG GCATCAAGGT GATCATCGAT GAAGCCTGGT ACGGGTTCGC CCGCTTTCAT
CCGGCTTTCC GTCCCACCGC CCTGGAAGCA GGCGCAGATT ATGCTACTCA AAGTACACAC
AAGGTGCTGT CGGCTTTTTC CCAGTCCAGC ATGATTCATA TCAATGATCC CGAATTCAAC
GAGCATCTGT TTCGGGAAAA TTTCAACATG CATACTTCCA CCAGCCCGCA GTACAGCATG
ATTGCAAGTC TCGACGTGGC GCGCAAACAG GTGGTGATGG AGGGATACAA GCTATTGTCG
CGCACGCTGG AGCTGGCGAA GGAAGTACGT GAGCAAATCA ATTCGACCGG CGTGTTTCGC
GTACTGGAAC TGACGGATCT GCTGCCTGAC GAGGTGAAGA ACGACAATAT CCAGCTCGAT
TCGACCAAGG TCACCGTCGA TATTTCGCAT TGTGGCTTTA CGGTGGAAGA TTTGGTCCGG
GAACTGTTCG AGCGATATAA CATTCAGGTG GAAAAATCCA CTTTCAATAC GCTCACTCTG
CTGCTGACCA TCGGTACCAC GCGCAGCAAG GTATCGCGCC TTTACGATGC TCTCATGCGC
ATCGCACGCG AGGGCAGGGC GCCCCGCAGA CTCTACCAGA TCCCGGAGCT TCCGGGATTT
ACTGAATTGA AGTATCTGCC GCGGGATGCC TTTTACTGCG GCGGCGAGAT CGTTCCGTTG
CTCGACGAGC AGGAGCGGAT AAATGATAGC CTGAAAGGGA AGGTCTGCGC GGATCAGATC
ACGCCTTACC CCCCGGGTAT TCCGGTCCTG GTGCCAGGCC AGACCATCAC GTCCGGGGTG
GTGCAATATC TAGTCAGCAT GCTACGATCG CAGAAACGGG TGGAAGTGCA CGGGATCGTT
TATGACGGCT ATCTGCCGTG TTTGAGGCTG TTGAGCGACG TCGAGGAAAA GAGCTTGAAA
AAGCTTGCAA AATAG
 
Protein sequence
MYRILNAQLQ EKARTPFYDQ LKSYVLMAKD AWHTPGHSSG DSLRDSPWAS DFYQFIGEHI 
FRADLSVSVP MLDSLMEPSG VIAEAQKIAA KAFGARRTFF ATNGTSTANK VIFQTLLAPG
EKLLLDRNCH KSVHHGVVLS GAHPIYLNSS VNKKFGVYGP VPKQTLFRAI EEHPDAQALI
LTSCTYDGFR YDLPPIIEAA HAKGIKVIID EAWYGFARFH PAFRPTALEA GADYATQSTH
KVLSAFSQSS MIHINDPEFN EHLFRENFNM HTSTSPQYSM IASLDVARKQ VVMEGYKLLS
RTLELAKEVR EQINSTGVFR VLELTDLLPD EVKNDNIQLD STKVTVDISH CGFTVEDLVR
ELFERYNIQV EKSTFNTLTL LLTIGTTRSK VSRLYDALMR IAREGRAPRR LYQIPELPGF
TELKYLPRDA FYCGGEIVPL LDEQERINDS LKGKVCADQI TPYPPGIPVL VPGQTITSGV
VQYLVSMLRS QKRVEVHGIV YDGYLPCLRL LSDVEEKSLK KLAK