Gene Nmul_A0893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0893 
Symbol 
ID3785935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1014174 
End bp1015334 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content57% 
IMG OID637810975 
Productputative AttH 
Protein accessionYP_411588 
Protein GI82702022 
COG category[R] General function prediction only 
COG ID[COG5621] Predicted secreted hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.27416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGTCG TAAAGCGTAA TCCCGGATAT TTCACCCGTT CCCGCAAACA AAGGTGGCAC 
GATGCCGGAT TCAAACGGCT GACTGGAACC GCTTTCCTTG TTGCAGCTTT GTTCCTTGCT
TTCTTTACCA CCGGTCGTGT TCTGGCAGAA CGGCCACAAC TCTCACCGGT AGTTCGAAAC
GTGCCCCTTG TGTTCCCACG GGATTTCGGG GCGCATCCTG GTTTCAGAAA TGAGTGGTGG
TATGTAACCG GCTGGCTGGA AACACCCGAA AAAGAACCGC TTGGCTTCCA GATCACCTTT
TTCCGTGTGG CGACCGAACA CGATCGCGCC AACCCCAGCC GCTTTGCCCC CAGAGACCTC
ATCATTGCCC ACGCCGCCTT GTCTGATCCG GCAGCGGGTA AACTCCTGCA TGACCAGAAA
AGTGCACGGG ATGGTTTTGG TCTGGCATAT ACCACAGAGG ACAACACGAA CGTCAAACTG
GGCGATTGGT TTATGGTGCG GGAGGAAAAC GGGCGCTACC AGACACGCAT AAAAGCAGAC
CATTTCCGGC TCGATTTTTC GCTGACGCCC ACGCAATCCC CCATGCTGCA AGATCTCAAC
GGTTTTTCCA GGAAGGGGCC GCACCCGGAG CAGGCCAGCT ATTACTACAG TGAGCCTCAC
CTGCAGGTAA GTGGGAAAGT AACGCGCGAT GGCGAGGAAA TCACCGTGAA GGGCATCGCG
TGGCTCGACC ACGAGTGGTC TACTGCCTAC CTCGATCCGA AAGCGGTAGG ATGGGATTGG
GTTGGCGCCA ATCTTGACGA TGGGTCAGCC CTGATGGCGT TTCAGATCCG CGGCAAGGAC
GGCAGCAAGG TCTGGGCGTA TGCCGGGATC CGGAAGCCGT CGGGGCAGTT CACGCGCTTT
GAACCGGATC AGGTAAGCTT TGAACCGCAA CGCACCTGGC ATTCAACACG CACCAACACC
ACCTATCCAG TCAAAATACG AATCCGGACC GGCACTACCG GCTGGATCCT CACACCCCTG
ATGGACGACC AGGAACTTGA CTCGCGGCAA TCCACCGGCG CCGTCTATTG GGAAGGCGCG
GTGACCGTTA CCCGCGATGG CGAACCTGCG GGGCGCGGCT ACCTCGAACT GACCGGTTAC
GTGGAGCCGC TGAATCTCTA A
 
Protein sequence
MRVVKRNPGY FTRSRKQRWH DAGFKRLTGT AFLVAALFLA FFTTGRVLAE RPQLSPVVRN 
VPLVFPRDFG AHPGFRNEWW YVTGWLETPE KEPLGFQITF FRVATEHDRA NPSRFAPRDL
IIAHAALSDP AAGKLLHDQK SARDGFGLAY TTEDNTNVKL GDWFMVREEN GRYQTRIKAD
HFRLDFSLTP TQSPMLQDLN GFSRKGPHPE QASYYYSEPH LQVSGKVTRD GEEITVKGIA
WLDHEWSTAY LDPKAVGWDW VGANLDDGSA LMAFQIRGKD GSKVWAYAGI RKPSGQFTRF
EPDQVSFEPQ RTWHSTRTNT TYPVKIRIRT GTTGWILTPL MDDQELDSRQ STGAVYWEGA
VTVTRDGEPA GRGYLELTGY VEPLNL