Gene Nmul_A2461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2461 
Symbol 
ID3786418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2809540 
End bp2810769 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content57% 
IMG OID637812552 
Producthypothetical protein 
Protein accessionYP_413142 
Protein GI82703576 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGATTG CGCTCCTTGT CGTCACTTCA CTTGTCATGC TGTTATTGCT TCCATCGGCA 
GGCAAGGCCG AAACGCACGT CCAGTTGCCG CAAACCAGCC TTACCCCTGG CGACGTCGCC
GTCATCATCA ATGACAGCGA CCCTCTTTCA GTGCAGATTG GACAGTACTA CCGCGAGGCA
CGCGGCATTC CGGCGTCCAA TATGATCCAT TTACAGTTTC CTCCCGGTCG AGCTGCACTG
TCGCGGGAGG AGTTTCAGGA ACTGAAAGCC GAGATAGACC AGAAGGTGCC GGCGCACGTC
CAGGCCTACG CAGTGGCCTG GACTCTTCCC TACCGGGTGG ATTGCATGTC GCTGACTTCC
GCGCTGGCTT TCGGGTTCGA CGAAAAGTAT TGTTCCTTTA ATTGCGGCGC AACTGCCGCC
AGTCCCTATT TCAATTCGCC GAGTCATCAT CCTGCCAGTG ACCACCAGTT GAGGCCGGCG
ATGATGTTGG CGGGGCTCGA TTTCGAACAG GTCAAATCGC TGATCGATCG TGGCGTGGCG
GCTGATCACA CTTTCCCCGC GGGACACGCC TATCTTGTCA TTACTTCTGA CAAGGCGCGA
AGCGTGCGCG CGAATTATTT CGAAATGACG GCGCAGCAAC TGAACGGTGT TTTTCCCATC
GAGATTCTTG AAACGGATGC TATTACCGAC CGGCACGACG TTCTATTTTA CTTCACGGGA
CTGGTCCGGG TACCGCAGCT CGACACGCTG GATTTTTTGC CGGGTGCATT GGCGGATCAT
TTGACCTCGG CGGGAGGGCA GCTTACAGGC TCAAGCCAGA TGAGCAGCCT GCAATGGCTT
GAAGCGGGAG CCACGGCCAG TTACGGCACA GTGGTGGAAC CTTGCAGCCA TATGCAGAAA
TTTCCTTTTC CCGCTATTGC CATGTTTCAT TACGCGCTTG GTGCAAGCGC CCTCGAAGCC
TACTGGAAGA GCGTGGCATG GCCGGGAGAA GGCGTCTTCA TAGGAGAGCC GCTCGCACGG
CCATTTGCGC CGGAGTTACG GGAAATCCGC ACCGGGCAGT TCGAATTACG CATTTTCTCG
CCACGCGAAA CACGGCTTCG AATCGAGCAG TCCCGTTCAG CCGCCGGACC GTTCAAGCCG
TCGCCCAGGC AGTTCCCGAT CCGGCGTGGG ATGAATCGAC TACGCTTCAA TTTTAATCAA
ACAGAAGGCT ACCTGCGGTT GAGATGGTAA
 
Protein sequence
MRIALLVVTS LVMLLLLPSA GKAETHVQLP QTSLTPGDVA VIINDSDPLS VQIGQYYREA 
RGIPASNMIH LQFPPGRAAL SREEFQELKA EIDQKVPAHV QAYAVAWTLP YRVDCMSLTS
ALAFGFDEKY CSFNCGATAA SPYFNSPSHH PASDHQLRPA MMLAGLDFEQ VKSLIDRGVA
ADHTFPAGHA YLVITSDKAR SVRANYFEMT AQQLNGVFPI EILETDAITD RHDVLFYFTG
LVRVPQLDTL DFLPGALADH LTSAGGQLTG SSQMSSLQWL EAGATASYGT VVEPCSHMQK
FPFPAIAMFH YALGASALEA YWKSVAWPGE GVFIGEPLAR PFAPELREIR TGQFELRIFS
PRETRLRIEQ SRSAAGPFKP SPRQFPIRRG MNRLRFNFNQ TEGYLRLRW