Gene Nmul_A2683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2683 
Symbol 
ID3785045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp3082572 
End bp3084536 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content56% 
IMG OID637812773 
Productprotein-disulfide reductase 
Protein accessionYP_413362 
Protein GI82703796 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4232] Thiol:disulfide interchange protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCTGA ACCGATATAG CCGATATTCC CTGCTGTTTC TCTGCCTTTT CAGCACCAGC 
GTCTTTGCGG AAGAAGGAAA AACTTCCGGA TTTCTTTCGG GTTTGCAGCA ACTGCGCTCG
ACCTTGGGCC AGGATCAGGA GGAGCAGGAA TTGCTGCCCC CTGACGAGGC TTTCAAGCTG
AAGGTTAAGG CACGGGACGC AAATACGCTT GTGGCGCAGT TCGAGCCGGC GAAGAACTAT
TATCTTTACA AGGATAAGGT TGCGTTCAAA CCGCAGACCC CCGGAACCAC TGTCGAAAAC
ATCTCTCTCC CGCAGGGAGA GATGAAAAGC GATGTGACCT TTGGGGAGGT CGAGGTTTTT
AACGAGCCGT TTGAAGCATT GATTTCCCTG AAGCGGACCG CGCCCTCTGG CGACAAGCTG
ACGCTTGTTG CCACTTATCA GGGTTGCAAC GAGCCCATCG GTGTCTGCTA TGCGCCCATA
AGCAAGGTAA TCGATCTGAC GCTGCCGGTT GCAAAGGCAG CGGCCGGAGC GGTCGCCAAT
GCAATGAGCG CCGATGCATC CGCCGCAACA GGTTCCGCTG GAGAACGCCT TGATGCAACT
GACGAGTTGT TCCAGTCGAA GGATGGTTCG GCCGCCATCG AGACGGAATC GTTGGAAATC
GAGAGAATGT TCCAGACCGG CAACTTCTGG CTGATTCTAA CCGGCTTCTT CGGTATCGGC
CTGCTCCTCT CCTTTACTCC CTGTGTATTT CCCATGTTCC CCATTCTATC CGGCATTATC
GCCAAGGGGG GGCAGCACGT TACCAAGCAG CGCGGATTCA TCCTGGCCCT GGCCTACGTG
CTCGGAATGG CCATTACCTA TGCTATCGCC GGGGTCGCAG CGGGACTCTC CGGAGCAATG
CTGTCGGCGA CCCTGCAGAA TGCCTGGGTA CTGGGCGCGT TCGCGCTGAT CTTTGTAACA
CTCGCATTTT CCATGTTCGG GTTCTATGAG CTGCAGCTCC CCACCTTTTT CCAGAGCAAG
ATTTCGGAAG AGGCTGGACA TCTCAAAGGG GGACACCTTA CGAGTGTATT CGGCATGGGC
GCCTTGTCCG CACTGATTGT CGGACCGTGT GTGGCAGCGC CTCTTGCGGG CGCGCTGCTG
TACATCAGCC AGACACGGGA TGTAGTGCTC GGAGGCTCGG CGCTCTTTGC CATGGCGCTG
GGCATGGGCC TGCCCTTGCT GTTGCTGGGC GCCTCCGCCG GCGCCCTCCT GCCGAAAGCA
GGTGCATGGA TGGAAGGAGT CAAGCAGTCG TTCGGGGTAT TGCTGCTGGG GGTGGCAATA
TGGCTGATAT CCCCGGTGAT TCCTGCTGTG GTGCACATGC TGCTATGGGC CGCCCTGCTG
ATCGTGTCGG CCATTTACCT GCATGCGGTT GACCCGCTGC GGCCGGATGC TTCCGGTCCC
CAGAAATTCC TCAAGGGGAT AGGCATGATT GCCCTGCTGA CGGGCATCGC GTTGCTCGTC
GGCGTGTTTT CCGGCAGCCG CGATATTCTG CAACCTCTTT CAAAGCTGAA TATTTCAGCG
GCTGGCATGG AAGGGGCAAA AAAGGACGGT CTCAACTCAA ACGAGCATTT GCCATTCCAG
AGAGTAAAGT CGGTAGCGGA ACTGAATCAG CAGATCCTCC AGTCGAGAAA CAAATACGTA
ATGCTGGATT TTTATGCAGA CTGGTGCGTC TCCTGCAAGG AAATGGAGCG TTTCACCTTT
ACCGACCCAG CTGTCCAGGC ACGATTGAAA GACGTTGTGC TGCTGCAGGT TGATGTGACA
GCAGGTACGC CCGAGGACAT GGAGCTTCTC AAGCGCTTCA AGCTGTTCGG GCCGCCGGCT
ATTCTCTTCA TGGACAAAGA GGGACGTCAA GTTCCCAACG TCAAAATCAT CGGTTATCAG
GATACCCCTG CTTTTCTGAA GATACTCAAC GCGGTGCTGA TCTAG
 
Protein sequence
MPLNRYSRYS LLFLCLFSTS VFAEEGKTSG FLSGLQQLRS TLGQDQEEQE LLPPDEAFKL 
KVKARDANTL VAQFEPAKNY YLYKDKVAFK PQTPGTTVEN ISLPQGEMKS DVTFGEVEVF
NEPFEALISL KRTAPSGDKL TLVATYQGCN EPIGVCYAPI SKVIDLTLPV AKAAAGAVAN
AMSADASAAT GSAGERLDAT DELFQSKDGS AAIETESLEI ERMFQTGNFW LILTGFFGIG
LLLSFTPCVF PMFPILSGII AKGGQHVTKQ RGFILALAYV LGMAITYAIA GVAAGLSGAM
LSATLQNAWV LGAFALIFVT LAFSMFGFYE LQLPTFFQSK ISEEAGHLKG GHLTSVFGMG
ALSALIVGPC VAAPLAGALL YISQTRDVVL GGSALFAMAL GMGLPLLLLG ASAGALLPKA
GAWMEGVKQS FGVLLLGVAI WLISPVIPAV VHMLLWAALL IVSAIYLHAV DPLRPDASGP
QKFLKGIGMI ALLTGIALLV GVFSGSRDIL QPLSKLNISA AGMEGAKKDG LNSNEHLPFQ
RVKSVAELNQ QILQSRNKYV MLDFYADWCV SCKEMERFTF TDPAVQARLK DVVLLQVDVT
AGTPEDMELL KRFKLFGPPA ILFMDKEGRQ VPNVKIIGYQ DTPAFLKILN AVLI