Gene Ndas_4410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4410 
Symbol 
ID9248285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5247216 
End bp5248505 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content67% 
IMG OID 
Productcitrate synthase I 
Protein accessionYP_003682305 
Protein GI297563331 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAG ACGACAAAGA GCGCACGGTG GAGCTCCACT ACGAGGGCGG CGTGCTGAAG 
CTGCCGGTGT TCACCGGTAC CGAGGGTGAG CGCACCATCG AGCTGAAGAC CCTGCTGGGG
TCGACGGGGA TGACGACCCT GGACCCGGGG TTCGGCAACA CCGCTTCGTG TAGCTCGGAG
ATCACCTACA TCGACGGGGC CGCCGGGATC CTGCGCTATC GCGGGTACCC GATCGAGGAC
CTCGCCGTGG GCGCGACCTT CATCGAGGTC GCCTACCTGG TGATCTACGG CGAGCTGCCC
GACGCGGACC AGCTCAAGGA GTTCGCCGAC AAGCTGCGCG ACAACGCCGA CATCCCGGCC
GACATGGGCG CGCTCATCGA CGCGATGCCC CGCAACGGCC ACCCCATGTC CCTCATGGCC
AGCGCCGTGA ACACCCTCGC CGCCTACTAC GACGACAGCG TCGACCCCGG CGACGAGGAC
CAGGTGGAGC TCGCCACGAT CCGGCTGTTG GCCAAGCTGC CGACGATCGC GGCCCGCATC
TACCGCAACT CGATCGGCGA GAAGCCGATC GGCCCGGACA GCTCGCTCGA CTACGTCGAC
AACTTCATCC GGATGACCTT CGGCGACGTC CACGCCGACA GCGAGCTGGG CGACCTGTTC
AACCAGGCCG TCGGTATGCT GCTGGTGCTG CACGCCGACC ACGAGCTCAA CTGCTCCACC
GCCACCGTAC GCGTCGTCGG CTCCTCGAAG GCCGACATCT ACGCCAGCGT GGCCTCGGGC
ATCAACGCCC TCTCCGGCCC GTCCCACGGC GGCGCCAACC AGGCCGTCCT GGAGATGCTG
GAGGAGATCC GCGACTCGGG CATCTCCATC GAGGACTTCC TGGAGAAGGT CAAGGCCCGC
GAGATGCGCC TCATGGGCTT CGGGCACCGG GTCTACAAGA ACTTCGACCC GCGCAGCAAG
GAGATCAAGG TTCTGGCCAG CCAGATCCTC GACCGCGACG AGAACCCGGA CGAGCTGTTC
GCGCTCGCCC TCAAGCTCGA GGCCGCCGCG CTCGCCGACT CCTACTTCAC CGAGCGCAAG
CTCTACCCGA ACGTCGACTT CTACACCGGC GTCATCTACC GCACCATGGG CTTCCCGACC
AACATGTTCA CCGTGTTGTT CGCCATCGGC CGCCTGCCCG GCTGGATCGC CCACTACCGC
GAGCAGCTGC GCGACCCGGC CTTCCGGATC GCGCGCCCGC GCCAGCACTA CGTCGGCTCC
GCCGAGCGCC GGCTCCCCGG CCAGGGGTAA
 
Protein sequence
MSEDDKERTV ELHYEGGVLK LPVFTGTEGE RTIELKTLLG STGMTTLDPG FGNTASCSSE 
ITYIDGAAGI LRYRGYPIED LAVGATFIEV AYLVIYGELP DADQLKEFAD KLRDNADIPA
DMGALIDAMP RNGHPMSLMA SAVNTLAAYY DDSVDPGDED QVELATIRLL AKLPTIAARI
YRNSIGEKPI GPDSSLDYVD NFIRMTFGDV HADSELGDLF NQAVGMLLVL HADHELNCST
ATVRVVGSSK ADIYASVASG INALSGPSHG GANQAVLEML EEIRDSGISI EDFLEKVKAR
EMRLMGFGHR VYKNFDPRSK EIKVLASQIL DRDENPDELF ALALKLEAAA LADSYFTERK
LYPNVDFYTG VIYRTMGFPT NMFTVLFAIG RLPGWIAHYR EQLRDPAFRI ARPRQHYVGS
AERRLPGQG