Gene Nmul_A1909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1909 
Symbol 
ID3784147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2199174 
End bp2200502 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content57% 
IMG OID637811995 
ProductFolC bifunctional protein 
Protein accessionYP_412596 
Protein GI82703030 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACTT TCTCCTTTGC TTCCGCAACG CTGCCAGAGT GGCTGGACTA CCTGGAACGC 
CTGCATCCCG CAGCTATCGA AATGGGACTG GAGCGGATAC GGCGCGTCCA GGCGGAACTG
GAGCTCGAAC CTTCATTTCC CATCATTTCC GTTGCGGGCA CCAACGGGAA GGGATCGACA
TGCATGATGC TGGAAGCCAT TCTGAGCCAT GCCGGATATC GGGTCGGATG CTATACCTCC
CCTCATTTGC TGCGATATAA CGAACGCGTG CGAATCGACC GGAAGGAGGC AAGCGATGAT
GAGCTGTGCG AGGCTTTCCG CGCTGTCGAA TCGGCGCGGA TGAATAGCGC CGTATCCCTG
ACCTATTTCG AGTTTGGCAC ACTGGCCGCC ATGTACTTGT TCAGTCAGGC CGAAGTGGAA
GTTGCAATTC TGGAGGTGGG ATTGGGCGGG CGGCTGGATG CGGTCAATGT GTTCGAGGCC
GATTGCGCGG TCCTTACCAG CGTGGATTTC GATCATATGG ATTATCTGGG CAATACGCGT
GAACAGATCG GATTCGAAAA GGCCGGAATT TTCAGGTCAG GGAAGGCCGC AATCTGTTCC
GAACCGGACC TGCCCATAAG CGTGCGTCGT CATGCGGAAT CGATCGGCGC CGACCTCATG
CATATCGGAG AGCACTTCGG TTATTCAACC GCTCCACAGT CCTGGAGCTA TTGGCGGAAC
GGCGAGAGCA GGCATGCACT CCCTTATCCG GCCTTGCGCG GCGCCTATCA GTTGAAAAAT
GCCAGTGCGT GCCTCGCCGC CCTGGATTCC CTGAAAGATA CATTGCCGGT TACCTTGAGC
GACATTCGTC ATGGTTTGCT GGAAGTGGTT TGGCCAGCCC GGTTTCAGGT GTTGCCTGGA
CAGCCGGTCA GGGTGCTCGA TGTTGCCCAT AATCCAGGCG CGGCACGCGC ATTGGCCGCC
AGCCTCGATA GCATGGGGCG TTATCCCAGA ACGTACGCGG TATTTGCAAT GCTCGGGGAT
AAAGATATCG CAGGCGTGGT ACGGGAGTTG AGGTCCAGCG TGGATGTCTG GCTGGTATCG
GGTATTGACG CTCCAAGAGG CGCTACGGCG GACGAAGCCG CCACGCAGGT TGCCCAGGCG
CTGCAAATCG CCGAGCCGCT CCCGGGGAAT GCAGGAGAAG GCGCCGCCCA TACCATCCGC
AAGTTCCGCA ATCCATCCGA GGCATATGCT TACGCCTGTG AGCAGGCAGC CAGAAATGAT
AGAATTTGTG TTTTCGGCTC ATTCCATACC GTGGCTGAAG TGTTAAGGAA CAGAATTGAG
CGCGGGTAG
 
Protein sequence
MSTFSFASAT LPEWLDYLER LHPAAIEMGL ERIRRVQAEL ELEPSFPIIS VAGTNGKGST 
CMMLEAILSH AGYRVGCYTS PHLLRYNERV RIDRKEASDD ELCEAFRAVE SARMNSAVSL
TYFEFGTLAA MYLFSQAEVE VAILEVGLGG RLDAVNVFEA DCAVLTSVDF DHMDYLGNTR
EQIGFEKAGI FRSGKAAICS EPDLPISVRR HAESIGADLM HIGEHFGYST APQSWSYWRN
GESRHALPYP ALRGAYQLKN ASACLAALDS LKDTLPVTLS DIRHGLLEVV WPARFQVLPG
QPVRVLDVAH NPGAARALAA SLDSMGRYPR TYAVFAMLGD KDIAGVVREL RSSVDVWLVS
GIDAPRGATA DEAATQVAQA LQIAEPLPGN AGEGAAHTIR KFRNPSEAYA YACEQAARND
RICVFGSFHT VAEVLRNRIE RG