Gene Nmul_A0618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0618 
Symbol 
ID3784414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp700464 
End bp701831 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content53% 
IMG OID637810700 
Productputative cytochrome c1 signal peptide protein 
Protein accessionYP_411317 
Protein GI82701751 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.84464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGAAA TGCTCATCTC CAGAAGCGTT ATCTCTCCCG TTCTTATCCT GGCGGGAATG 
CTCCTTACGC TATCAAGCGC CGAAGTCAAA GCGGTGCCCA GCTTTGCGCG CCAGACCGGC
ATGCCGTGCA GCACCTGCCA TGTGCAGGCT TTTGGACCGC TCCTTACGTC TATCGGACGC
AATTTCAAGC TATCCGGTTA CACCGATGTG AACCCCGATA GAACCAAATT CATCCCGATC
ACTGGCATGA TACGCGGCTC GTTTACCCAT ACCAATAACG GCCAGCTTGG CGGCGCGGCT
GACCGCTTCG GCCCCAACAA TAACGCAACT ATCGACGAAG CCTCGATTTT TTATGCTGGC
CGCATTACGT CCAAAATCGG CGCTTTCGCG CAGGGGACCT ATGATGGCGT AAGCAATACG
GGAGCGCTCG ATAACACCGA TATTCGTTTT GCCAATGGGG CCGACCTGGC CGGTAATCGC
CTCGTTTATG GCATTTCTGT GAATAATAAC CCTACCGTTC AGGATCTCTG GAACACCACT
CCTGCCTGGG GTTTCCCTTG GGCGTCATCC CCACTTGCCC CCACCCCTGC TGCGGGACTA
TTCATTGAGT CTCTTGGCAG CCAAGTCGTC GGTGCAACGG TCTATACGAT GTGGAATGAT
ATGTTATACG TTGAGGGAGG GGGCTACACC AGCCTTCCGC GGAACATTCA GCAAGGGATT
GGAACGTTTG ACGCGGGACA AAACCGGATC GATGGCGGTG CGCCTTACGG GCGGGTGGCG
CTACAAAACA ACTGGCAAGG CCATTACGGT GCCATCGGCT CCTTCGGCAT GAGAGCAAAT
GCCAATCCAC AGCGGATTCA AGGTGCGAGT ACCGACCAGT ACACCGATTT CGGGTTTGAC
GCCACCTACC AATATCTGGC GAATCCCAGG CATATCCTTG AACTCAATGC CACTTACATC
CGGGAGCACC GCGACCTGAA TGCCAGCGTG GGATTGGGAT TTGCAGAAAA AAGACATGGA
AGCCTGGACG TGGTTCGCGT CAGAAGCGGA TATACTTTCC TCCAGACTTA TTCGCTCAAT
CTAGCGTATG CCCAGACCTC GGGCACACGG GATAATGTCA TATACTCTCC CGATCCTATT
GATGGCAGTT TATCAGGCAA GCCAAATAGT CAGGCATTTA CGGTTGAGGT GAGCTATATC
CCATTCGGCA AGAGTACTTC CGTCCTGTCT ACGTTTGCGA ATCTCAAACT TACGGCGCAG
TACATTCATT ACTTCCAGTT CAACGGAGGT TTTCGCAATT ATGACGGCTT CTCCCGCAAT
GCGCCTGGTA ACGATACGGT ATATCTGAAC GGCTGGATGG CGTTCTAA
 
Protein sequence
MREMLISRSV ISPVLILAGM LLTLSSAEVK AVPSFARQTG MPCSTCHVQA FGPLLTSIGR 
NFKLSGYTDV NPDRTKFIPI TGMIRGSFTH TNNGQLGGAA DRFGPNNNAT IDEASIFYAG
RITSKIGAFA QGTYDGVSNT GALDNTDIRF ANGADLAGNR LVYGISVNNN PTVQDLWNTT
PAWGFPWASS PLAPTPAAGL FIESLGSQVV GATVYTMWND MLYVEGGGYT SLPRNIQQGI
GTFDAGQNRI DGGAPYGRVA LQNNWQGHYG AIGSFGMRAN ANPQRIQGAS TDQYTDFGFD
ATYQYLANPR HILELNATYI REHRDLNASV GLGFAEKRHG SLDVVRVRSG YTFLQTYSLN
LAYAQTSGTR DNVIYSPDPI DGSLSGKPNS QAFTVEVSYI PFGKSTSVLS TFANLKLTAQ
YIHYFQFNGG FRNYDGFSRN APGNDTVYLN GWMAF