Gene Nmul_A1804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1804 
Symbol 
ID3786355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2062799 
End bp2064121 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content55% 
IMG OID637811890 
Productferredoxin-dependent glutamate synthase 
Protein accessionYP_412493 
Protein GI82702927 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0069] Glutamate synthase domain 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.922442 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAAG AATTGAAAAA GAGCATAGCG GCCAGCATCG AAAACACTCG GTTCGACATT 
TTGCAGATGA TCGAACTGGG ACGTGAGCTT ATCGACTATC CCGATGAAAT CCATCACGGG
CCGGTGGTAT CCATGGGGGT GCCGAAGGAT ACGCTGCCCA AATGGGACGA TATCCAGATT
CTGACAGCGC AGCTCCATAA AAAACCATTA ATAGATGATG CCCCGGTGGA TACTCAGCTG
GTGATCGGTC CGCGTGCCCA AAAACCCCTG GTTCTGGATA TCCCGCTGCT GGTTGGCGAT
ATGAGTTATG GCGCGCTGTC GAAAAGGGCC AAGCAGGCAT TGTCAAAGGG GGCTGATCTG
GCCGGCACTG CGATCTGTTC AGGTGAGGGC GGCATTCTGG GGGATGAACT CGAGCTGAAT
CAAAGTTATA TGTTCGAATT GGGCAGCGCG CGCAACGGCC TGAAGAAAAA CAGCGACTTG
AGTCAGTTTT CGAAGGATTT TAAAGGCAAA GTCAAGGCGT TTCACTTCAA GGGCGGGCAG
GCGGCCAAGA CCGGTACAGG TGGGCATTTG CCGGGAGGCA AGGTTACGGA GGAGATTGCC
AAGGCACGCC AGATTGAAGT CGGCAAGGAT GCGATCTCCC CATCCACCTT TGCCGATTTC
CACACACCCC GCGATTTTGC TGATTTTGCC GATCAGATCC GCGACCAGAT GGGCGGCATT
CCGATTGGTT TCAAGATAAG CGCAAACCAC ATCGAGGATG ACATGCGCTT CGCGCTCGAT
GCCGGGGCCG ATTACATCAT TCTCGATGGA CGAGGTGGCA GTACCGGTGC TGCGCCGGGG
ATTTTCCGGG ACCATATTTC AGTGCCAACG ATCGCTGCAT TGGCGCGGGC GAGAAAATTT
CTCGACGTCG CAGGCCATGA GAAAGGTGCA AAGAATAGCG TGACCCTGAT TATTACCGGC
GGTTTGCGCA TTCCTTCCGA TTTCATTAAG GCGTTAGCCT TGGGAGCCGA CGGAATCGCG
CTGGCCAATA GTGCGCTGCA GGCAATTGGC TGTACCGGTG CGCGCATCTG CTACTCCGGC
GATTGTCCTG CCGGCATAGC AACCCATGCC CAACACTTGG TCAACAAGAT CGACGTCGAC
GCCAGGGCGC GTGACCTGGC GCTGTTTTTC AACGGCAGTG TACACCTGAT GAAGATCATG
ACAAGAGCCT GCGGCCACCA TTCCCTGCGG GAATTTAGCC GGGATGATAT TTGCACCTGG
AAAGCCGAAA TATCGAGGTT GGCTGGAATT CCCTATGCCG GTGTCGATCC CTCAAATCCC
TGA
 
Protein sequence
MNEELKKSIA ASIENTRFDI LQMIELGREL IDYPDEIHHG PVVSMGVPKD TLPKWDDIQI 
LTAQLHKKPL IDDAPVDTQL VIGPRAQKPL VLDIPLLVGD MSYGALSKRA KQALSKGADL
AGTAICSGEG GILGDELELN QSYMFELGSA RNGLKKNSDL SQFSKDFKGK VKAFHFKGGQ
AAKTGTGGHL PGGKVTEEIA KARQIEVGKD AISPSTFADF HTPRDFADFA DQIRDQMGGI
PIGFKISANH IEDDMRFALD AGADYIILDG RGGSTGAAPG IFRDHISVPT IAALARARKF
LDVAGHEKGA KNSVTLIITG GLRIPSDFIK ALALGADGIA LANSALQAIG CTGARICYSG
DCPAGIATHA QHLVNKIDVD ARARDLALFF NGSVHLMKIM TRACGHHSLR EFSRDDICTW
KAEISRLAGI PYAGVDPSNP