Gene Nmul_A0395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0395 
Symbol 
ID3785388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp436563 
End bp437681 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content58% 
IMG OID637810471 
ProductDNA processing protein DprA, putative 
Protein accessionYP_411095 
Protein GI82701529 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATCAA TACCTGTCCA GGTAATTTCG CAGCGGAATA TCGAACCCGA TATCGCATCC 
TGGCTCGCCC TGGACTTGAT CGATGGCCTG GGTGACGAGT CGATGAGGTG TTTGCTTGCG
ACTTTTGGCA GCCCCGCTGC AATTCTTTCG GCCAGCATGA CTTCATTGGA GCGCGTGGTT
AAAAGGAAGG TGGCGGACAA CATTATCGGG GGAGCTGACC CGCAAAAGCT GAATGCTTCG
CTCAAATGGC TGGAAGACCC GCAGAATTCC GTCATCACCC TGGCAGATCC GGATTATCCC
GCACTGTTAC TCCATATCCC CGATCCCCCG CCGCTTCTCT ATGTCAAAGG AAAGCGCGCT
CTGCTGAACG CGCCAATGCT TGCCATTGTC GGCAGCCGGA ATGCTACGCC CCAAGGTCTT
TCCAATGCAG AAGCCTTTGC CGAGGCGGCG AGCAATGCAG GATTTTCCAT CGCCAGCGGC
ATGGCTCTTG GCATTGACGC CGCAGCGCAT CGTGGAGGAC TGCGAGGAAG GGCCAGCAGT
ATTGCCGTGG TGGGTACCGG ACTGGATCTC GTTTATCCCG CAAGCCATCG CAAGCTGGCG
CATGAGTTGG CGGAAAGGGG CGCGCTTGTC TCCGAGTTTC CGTTGGGCAC GCCTCCCATC
GGCAGCAACT TTCCGCGTCG CAATCGCATC ATCAGTGGCC TGAGCAGGGG ATGCCTTGTG
GTCGAGGCCG CATTGCAGAG CGGTTCTCTT ATCACGGCGC GGCAGGCGTT GGAGCAGGGA
CGGGAAGTAT TCGCCATTCC GGGCTCCATC CACTCGCCCT TGTCCAGGGG ATGCCATGCA
CTCATCAAGC AGGGTGCCAA GCTGGTGGAA AGCGCAGGAG ATATTCTGGA GGAATTCGGT
TGCCCATCTG GCATCCCCAT CCTCGTTCCG GAAGGGGGTG AGGCTGCAAG AGAAGAATTT
TTGCTGTTGA AACACCTCAG CCATGACATC ATCGATGTCG ATACCCTCTG CCTGCGTAGT
GGCTTGACGG TAGAAACGGT ATCGGCCATG CTGTTGACGC TTGAACTGGA TGGCATAATC
GCCAGTCTTC CCGGCGGGCG TTACCAGCGG CTCCGATAG
 
Protein sequence
MSSIPVQVIS QRNIEPDIAS WLALDLIDGL GDESMRCLLA TFGSPAAILS ASMTSLERVV 
KRKVADNIIG GADPQKLNAS LKWLEDPQNS VITLADPDYP ALLLHIPDPP PLLYVKGKRA
LLNAPMLAIV GSRNATPQGL SNAEAFAEAA SNAGFSIASG MALGIDAAAH RGGLRGRASS
IAVVGTGLDL VYPASHRKLA HELAERGALV SEFPLGTPPI GSNFPRRNRI ISGLSRGCLV
VEAALQSGSL ITARQALEQG REVFAIPGSI HSPLSRGCHA LIKQGAKLVE SAGDILEEFG
CPSGIPILVP EGGEAAREEF LLLKHLSHDI IDVDTLCLRS GLTVETVSAM LLTLELDGII
ASLPGGRYQR LR