Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1698 |
Symbol | |
ID | 3784797 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 1937321 |
End bp | 1938556 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637811785 |
Product | phosphoesterase, PA-phosphatase related |
Protein accession | YP_412388 |
Protein GI | 82702822 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00849599 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCAAG ATGTAAGGAA AAATAGGTTA CTGGTACTGG CAGTAGTCGT TATATCTATT CTTGTCAGTT CCGCCGCGCG AGCCGACGCA GTGACGCACT GGAACCGGGT GGCAGGGGAT ATTATCGTGG ACTCGGGATT GGGCCCGCTC CCGGCAGATC GTGCTCTCGC AATCGTGCAG ACTGCGGTAT ATGAGGCAAC AAACGCGATT ACCCAGCAAT ATCCGGCCAG CGATCTCAAG CTGAAAGGTA AGCCGGAAGC CTCGGTTGAG GCCGCCATTG CAGCGGCAAA TCGCGCTACG CTTACAGCCC TGGTACCGGT GCAACGGACA GCTATCGATA CTGCCTATCG CCAAGCGCTT GCCGCCATCC CGGATGACAT CGCAAGAAGC GATGGCCTTG CGATTGGCGA AAAAGCCGCA GCGGGAATTC TGTCGCAGCG GGCACAAGAC GGTGCCGATG CAGGAGAATC ATACCGTCCG CACACCTCTC CCGGACTCTA TGTGCCTACT GTAATACCGG AAGCGCCGCA CTGGTTGCAA ATCAGGCCTT GGCTTATGGA TAACCCGGCC CAATTTCGTC CCGGCCCTCC TCCCCAACTG GAAAGCGAGC TATGGGCACG CGACTACAAT GAAGTCAAGG CGCTGGGCGG GAAGCACAGC CAGCGCACCG CTGCTCAGAC CGAAATTGCC CGCTTTTGGG AAGAAGTAAT GCCTCCGATC TACCATGGAG TAATACGCTC GGTTGCCGAA GCCCCCGGAA GGGAGATCAC TCGGAATGCG CGTCTCTTTG CGGCGGCAAC CCAGGCTTCC ACCGACGCGC TTATAGCGGT ATTTGACGCC AAGTATCATT ACGGCTTCTG GCGCCCGGTC ACTGCCATTC GCAATGGCGA TGTTGATGGA AACAATGCGA CAGAACGGGA TCCTTCATGG CTTCCCTTTA TCGACACCCC CATGCATCCT GAATACCCCT GCGCCCACTG CATCGTAGCA GGCGCGGTAG GAACAGTACT GAAAGCTGAG ATTGGGGCAG ATCCCATACC GCCCCTGGCT ACTACCAGCC GGGCCGCGGG CGGCGTGATG CGCAGTTGGA AGAATATCGA TGAGATCATC CAGGAAGTGG CCAATGCGCG CATATATGAT GGAGTACACT ACCGCAACTC CGGTGAGGTA GGCACCGCCA TGGGCAGACG AATTGCCGAG CTGGCAGTTA TGAAATATTT CCGCACTGAC CAGTAA
|
Protein sequence | MTQDVRKNRL LVLAVVVISI LVSSAARADA VTHWNRVAGD IIVDSGLGPL PADRALAIVQ TAVYEATNAI TQQYPASDLK LKGKPEASVE AAIAAANRAT LTALVPVQRT AIDTAYRQAL AAIPDDIARS DGLAIGEKAA AGILSQRAQD GADAGESYRP HTSPGLYVPT VIPEAPHWLQ IRPWLMDNPA QFRPGPPPQL ESELWARDYN EVKALGGKHS QRTAAQTEIA RFWEEVMPPI YHGVIRSVAE APGREITRNA RLFAAATQAS TDALIAVFDA KYHYGFWRPV TAIRNGDVDG NNATERDPSW LPFIDTPMHP EYPCAHCIVA GAVGTVLKAE IGADPIPPLA TTSRAAGGVM RSWKNIDEII QEVANARIYD GVHYRNSGEV GTAMGRRIAE LAVMKYFRTD Q
|
| |