Gene Nmul_A2166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2166 
Symbol 
ID3784406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2458554 
End bp2459555 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content59% 
IMG OID637812254 
Productpeptidase T2, asparaginase 2 
Protein accessionYP_412851 
Protein GI82703285 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1446] Asparaginase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATAC ACATGCGCGG AAGCGCGGGG AATGATTCGC CGATCCGGCT CGTGATTCAC 
GGAGGGGCCG GGAGCGTGAA ATCCGGGCAG GGGCCAGAAC GCGAGCAGGC TTACAGGGAG
GCACTCGCGC AATCGCTGAT CGCTGGCCAT GCGGTGCTTG CAGCAGGCGG CAACAGCATC
GATGCAGTGA TTGCGGCCAT TACAGTGATG GAGGATTGCC CCTTGTTCAA TGCCGGCAAG
GGCGCCGTGC TGACCCATGA TGGCCGCAAC GAACTGGAAG CTTCAATCAT GGAAGGGGCC
ACTCGCGCGG CAGGCGCCGT TGCGGGAGTG ACGACGATTC GCAATCCGAT TCGCGCCGCA
CATGCGGTAA TGACGAAAAG CGCCCACGTG ATGCTCATCG GGCAAGGTGC GGAAATTTTC
GCTGCAAAGC AGGATCTGGA AATCGTGGAT TCGTCCTATT TCTATACCCG GCACCGTTGG
AATCAGCTGC AGAAAGCCAT CGCTAAAGAA AGTATCCTGC TCGACCATGA TGCGGGTCTG
GATACATTGC CGGGTGAGGA TGAAAAACGT GGAACCGTTG GCGCAGTGGC GCTCGACTGC
CAGGGCAATC TGGCGGCCGG CACGTCGACC GGTGGGCTCA CGAACAAGCA CCCTGGCCGG
GTAGGAGACT CGTCCATTAT CGGGGCGGGA ACCTACGCGG ACAATCGCTC GGTTGCGGTA
TCCACGACTG GTACAGGCGA AATGTTCATT CGTACCGCTG CCGCTTTCAA CACAGCAGCG
CAGGTGCGAT TTCTGCATGC TCCGATTACT GCGGCAGCCG ATAACACGCT GGAGGAAATC
GCGGCGATAG GTGGAGATGG GGGCCTGATC GTCCTGGATG CGGACGGCAA CTATGCGATA
CGGTTCAACA CCGGCGCCAT GTTTCGCGGC ACAATTGGAG AGGATGGCAT AGCACGGACA
GGTATTTTTC CCGGGCCTGA AACACCAGCT CTGTCATGTT GA
 
Protein sequence
MTIHMRGSAG NDSPIRLVIH GGAGSVKSGQ GPEREQAYRE ALAQSLIAGH AVLAAGGNSI 
DAVIAAITVM EDCPLFNAGK GAVLTHDGRN ELEASIMEGA TRAAGAVAGV TTIRNPIRAA
HAVMTKSAHV MLIGQGAEIF AAKQDLEIVD SSYFYTRHRW NQLQKAIAKE SILLDHDAGL
DTLPGEDEKR GTVGAVALDC QGNLAAGTST GGLTNKHPGR VGDSSIIGAG TYADNRSVAV
STTGTGEMFI RTAAAFNTAA QVRFLHAPIT AAADNTLEEI AAIGGDGGLI VLDADGNYAI
RFNTGAMFRG TIGEDGIART GIFPGPETPA LSC