Gene Nmul_A1698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1698 
Symbol 
ID3784797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1937321 
End bp1938556 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content57% 
IMG OID637811785 
Productphosphoesterase, PA-phosphatase related 
Protein accessionYP_412388 
Protein GI82702822 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00849599 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAAG ATGTAAGGAA AAATAGGTTA CTGGTACTGG CAGTAGTCGT TATATCTATT 
CTTGTCAGTT CCGCCGCGCG AGCCGACGCA GTGACGCACT GGAACCGGGT GGCAGGGGAT
ATTATCGTGG ACTCGGGATT GGGCCCGCTC CCGGCAGATC GTGCTCTCGC AATCGTGCAG
ACTGCGGTAT ATGAGGCAAC AAACGCGATT ACCCAGCAAT ATCCGGCCAG CGATCTCAAG
CTGAAAGGTA AGCCGGAAGC CTCGGTTGAG GCCGCCATTG CAGCGGCAAA TCGCGCTACG
CTTACAGCCC TGGTACCGGT GCAACGGACA GCTATCGATA CTGCCTATCG CCAAGCGCTT
GCCGCCATCC CGGATGACAT CGCAAGAAGC GATGGCCTTG CGATTGGCGA AAAAGCCGCA
GCGGGAATTC TGTCGCAGCG GGCACAAGAC GGTGCCGATG CAGGAGAATC ATACCGTCCG
CACACCTCTC CCGGACTCTA TGTGCCTACT GTAATACCGG AAGCGCCGCA CTGGTTGCAA
ATCAGGCCTT GGCTTATGGA TAACCCGGCC CAATTTCGTC CCGGCCCTCC TCCCCAACTG
GAAAGCGAGC TATGGGCACG CGACTACAAT GAAGTCAAGG CGCTGGGCGG GAAGCACAGC
CAGCGCACCG CTGCTCAGAC CGAAATTGCC CGCTTTTGGG AAGAAGTAAT GCCTCCGATC
TACCATGGAG TAATACGCTC GGTTGCCGAA GCCCCCGGAA GGGAGATCAC TCGGAATGCG
CGTCTCTTTG CGGCGGCAAC CCAGGCTTCC ACCGACGCGC TTATAGCGGT ATTTGACGCC
AAGTATCATT ACGGCTTCTG GCGCCCGGTC ACTGCCATTC GCAATGGCGA TGTTGATGGA
AACAATGCGA CAGAACGGGA TCCTTCATGG CTTCCCTTTA TCGACACCCC CATGCATCCT
GAATACCCCT GCGCCCACTG CATCGTAGCA GGCGCGGTAG GAACAGTACT GAAAGCTGAG
ATTGGGGCAG ATCCCATACC GCCCCTGGCT ACTACCAGCC GGGCCGCGGG CGGCGTGATG
CGCAGTTGGA AGAATATCGA TGAGATCATC CAGGAAGTGG CCAATGCGCG CATATATGAT
GGAGTACACT ACCGCAACTC CGGTGAGGTA GGCACCGCCA TGGGCAGACG AATTGCCGAG
CTGGCAGTTA TGAAATATTT CCGCACTGAC CAGTAA
 
Protein sequence
MTQDVRKNRL LVLAVVVISI LVSSAARADA VTHWNRVAGD IIVDSGLGPL PADRALAIVQ 
TAVYEATNAI TQQYPASDLK LKGKPEASVE AAIAAANRAT LTALVPVQRT AIDTAYRQAL
AAIPDDIARS DGLAIGEKAA AGILSQRAQD GADAGESYRP HTSPGLYVPT VIPEAPHWLQ
IRPWLMDNPA QFRPGPPPQL ESELWARDYN EVKALGGKHS QRTAAQTEIA RFWEEVMPPI
YHGVIRSVAE APGREITRNA RLFAAATQAS TDALIAVFDA KYHYGFWRPV TAIRNGDVDG
NNATERDPSW LPFIDTPMHP EYPCAHCIVA GAVGTVLKAE IGADPIPPLA TTSRAAGGVM
RSWKNIDEII QEVANARIYD GVHYRNSGEV GTAMGRRIAE LAVMKYFRTD Q