Gene Nmul_A2241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2241 
Symbol 
ID3784942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2546609 
End bp2547991 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content54% 
IMG OID637812329 
Productadenylosuccinate lyase 
Protein accessionYP_412925 
Protein GI82703359 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0280577 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCAT CCTTTCTCAC CGCACTCTCT CCTCTTGATG GCCGCTATCA CGGCAAAGTC 
GATGCACTGA GACCCTATTT CAGCGAACTC GGGCTGATCC GTTACCGCGT CCAGCTTGAG
ATCGAATGGC TCAAGGCATT GAGCCGGGCT TCCGCCATTG CTGAAGCACT GCCGTTATCT
CCGGATACGC TTGCACAGCT TGATGCTCTG ATAACGGATT TTTCCGAAAG GGATGGCGAA
GCGGTCAAAC TCATCGAGGC GCGCACCAAT CATGACGTGA AAGCCGTGGA ATACTGGCTT
CGCAAGCAGT TAACGGAAAA TGCCGAGATC GGCAGAATCG AACAATTCAT TCATTTTGCC
TGCACTTCAG AGGACATCAA CAATCTTTCC CATGGATTGA TGCTGATGCA CAGCCGCGAC
GACATCATGT TGCCGGCGCT GGATAACATC ATTGGCAGGC TCGTCCGGCT TGCGCACGAG
CTGGCCGCAG TGCCCATGCT TGCACGCACG CACGGGCAGG CTGCCACGCC GACCACGGTG
GGCAAGGAGC TCGCCAATTT CGCTTATCGT CTGCAACGGG GACGCCGGCG CTTGGCGCAG
GTGGCTATTC TCGGAAAAAT CAATGGAGCT GTGGGTAATT ACAACGCCCA CCTGGCGGCC
TATCCCGACT TCGGATGGGA GAAATTCGCG CAGGATTTTG TGGAAAAGCT CGGCCTGCAA
TTCAATCCTT ATACTACCCA GATTGAACCG CACGACACCG TAGCTGAACT GTTCGATGCC
TATGCCAGGA TCAATACGAT TCTGCTGGAT TTCAACCGGG ATATATGGGG ATACATCTCG
CTAGCCTACT TCAAGCAAAA AACCCGAAAA GATGAAGTCG GTTCCTCAAC CATGCCCCAC
AAGGTCAATC CGATAGATTT TGAAAATTCC GAAGGTAATC TGGGGATTGC CAACACGCTC
CTGCGGCACT TGAGCGAGAA GCTGCCGATA TCGCGCTGGC AGCGAGACCT TACCGACTCC
ACTGCACTGC GTAACATGGG TGTCGCATTA GGCCATACTT TATTGGCATA TGACTCCTGC
AGCAGGGGCC TGGACAAACT GGAAATCAAC CCCGGCCGTC TGGAGGAGGA TTTGAGAAAT
GCCTGGGAAG TGCTGGCCGA GCCCATTCAG ACGGTGATGC GTCGCCACGG CATGCCGGAT
TCATACGAGC GGTTGAAAGA ACTGACCCGG GGCAAGGGCG GTATCACCCA AGACGCGCTC
CATCAGTTTA TTGACAGTCT CGCCCTTCCT GATGTGGAAA AAAAACGATT GCGTGAAATG
AGGCCGGAAA CATACCTGGG GAATGCTGCT GCATTAGCGA GAAAAATCAG TACGGAATAC
TGA
 
Protein sequence
MNSSFLTALS PLDGRYHGKV DALRPYFSEL GLIRYRVQLE IEWLKALSRA SAIAEALPLS 
PDTLAQLDAL ITDFSERDGE AVKLIEARTN HDVKAVEYWL RKQLTENAEI GRIEQFIHFA
CTSEDINNLS HGLMLMHSRD DIMLPALDNI IGRLVRLAHE LAAVPMLART HGQAATPTTV
GKELANFAYR LQRGRRRLAQ VAILGKINGA VGNYNAHLAA YPDFGWEKFA QDFVEKLGLQ
FNPYTTQIEP HDTVAELFDA YARINTILLD FNRDIWGYIS LAYFKQKTRK DEVGSSTMPH
KVNPIDFENS EGNLGIANTL LRHLSEKLPI SRWQRDLTDS TALRNMGVAL GHTLLAYDSC
SRGLDKLEIN PGRLEEDLRN AWEVLAEPIQ TVMRRHGMPD SYERLKELTR GKGGITQDAL
HQFIDSLALP DVEKKRLREM RPETYLGNAA ALARKISTEY