Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0395 |
Symbol | |
ID | 3785388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 436563 |
End bp | 437681 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637810471 |
Product | DNA processing protein DprA, putative |
Protein accession | YP_411095 |
Protein GI | 82701529 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATCAA TACCTGTCCA GGTAATTTCG CAGCGGAATA TCGAACCCGA TATCGCATCC TGGCTCGCCC TGGACTTGAT CGATGGCCTG GGTGACGAGT CGATGAGGTG TTTGCTTGCG ACTTTTGGCA GCCCCGCTGC AATTCTTTCG GCCAGCATGA CTTCATTGGA GCGCGTGGTT AAAAGGAAGG TGGCGGACAA CATTATCGGG GGAGCTGACC CGCAAAAGCT GAATGCTTCG CTCAAATGGC TGGAAGACCC GCAGAATTCC GTCATCACCC TGGCAGATCC GGATTATCCC GCACTGTTAC TCCATATCCC CGATCCCCCG CCGCTTCTCT ATGTCAAAGG AAAGCGCGCT CTGCTGAACG CGCCAATGCT TGCCATTGTC GGCAGCCGGA ATGCTACGCC CCAAGGTCTT TCCAATGCAG AAGCCTTTGC CGAGGCGGCG AGCAATGCAG GATTTTCCAT CGCCAGCGGC ATGGCTCTTG GCATTGACGC CGCAGCGCAT CGTGGAGGAC TGCGAGGAAG GGCCAGCAGT ATTGCCGTGG TGGGTACCGG ACTGGATCTC GTTTATCCCG CAAGCCATCG CAAGCTGGCG CATGAGTTGG CGGAAAGGGG CGCGCTTGTC TCCGAGTTTC CGTTGGGCAC GCCTCCCATC GGCAGCAACT TTCCGCGTCG CAATCGCATC ATCAGTGGCC TGAGCAGGGG ATGCCTTGTG GTCGAGGCCG CATTGCAGAG CGGTTCTCTT ATCACGGCGC GGCAGGCGTT GGAGCAGGGA CGGGAAGTAT TCGCCATTCC GGGCTCCATC CACTCGCCCT TGTCCAGGGG ATGCCATGCA CTCATCAAGC AGGGTGCCAA GCTGGTGGAA AGCGCAGGAG ATATTCTGGA GGAATTCGGT TGCCCATCTG GCATCCCCAT CCTCGTTCCG GAAGGGGGTG AGGCTGCAAG AGAAGAATTT TTGCTGTTGA AACACCTCAG CCATGACATC ATCGATGTCG ATACCCTCTG CCTGCGTAGT GGCTTGACGG TAGAAACGGT ATCGGCCATG CTGTTGACGC TTGAACTGGA TGGCATAATC GCCAGTCTTC CCGGCGGGCG TTACCAGCGG CTCCGATAG
|
Protein sequence | MSSIPVQVIS QRNIEPDIAS WLALDLIDGL GDESMRCLLA TFGSPAAILS ASMTSLERVV KRKVADNIIG GADPQKLNAS LKWLEDPQNS VITLADPDYP ALLLHIPDPP PLLYVKGKRA LLNAPMLAIV GSRNATPQGL SNAEAFAEAA SNAGFSIASG MALGIDAAAH RGGLRGRASS IAVVGTGLDL VYPASHRKLA HELAERGALV SEFPLGTPPI GSNFPRRNRI ISGLSRGCLV VEAALQSGSL ITARQALEQG REVFAIPGSI HSPLSRGCHA LIKQGAKLVE SAGDILEEFG CPSGIPILVP EGGEAAREEF LLLKHLSHDI IDVDTLCLRS GLTVETVSAM LLTLELDGII ASLPGGRYQR LR
|
| |