Gene Ndas_3391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3391 
Symbol 
ID9247256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4053094 
End bp4054446 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content71% 
IMG OID 
ProductFolC bifunctional protein 
Protein accessionYP_003681302 
Protein GI297562328 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGAAG CCAGCGCCGA GCGGCGCTAC GCCGAGGTGA CCGCGGAGAT CCTCGCCCGC 
GCGCCCGAGT CCGACATCGA TCCTCGACTG GACCGGGTCC GCACCCTGCT GGACCTCCTG
GGCGACCCGC ACCGCAACTT CCGGGCCATC CACGTCACCG GTACCAACGG CAAGACCTCC
ACCGCCCGCA TGATCGACGC GCTCATGCGC GGGCGGGGTC TGCGCGTGGG CCGCTACACC
AGCCCGCACC TGCGCACCGT GCGCGAGCGC ATCGTCATCG ACGGGGAGCC CATATCCCAG
GAGCGGTTCG TCGCGGCCTA CGACGACATC CGCCCCTACG TCGAGATGGC CGACTCGATG
AACGACGCAC CGCTGTCGTT CTTCGAGATC CTCACGGTGA TGGCCTACGC CGTCTTCGCC
GACGCCCCCG TGGACGTGGC CGTCGTCGAG GTCGGCATGG GCGGCCGGTG GGACGCCACC
AACGTCATCG ACGGAGACGT CGCCGTGGTG ACCCCCATCG GGATCGACCA CACCGAGTAC
CTGCCCGACA CGGTGGAGGG CATCGCCGAG GAGAAGGCGG GCATCATCAA GCCCGACTCC
GTGGCGGTCC TGGCCCAGCA GCCGCTGCCC GCCGCCGAGG CCCTGGTCCG CAACGCCGCG
GAGGTCGGGG CGCGGGTGGC CAGGGAGGGC CTGGAGTTCG GCGTCACCTC CCGCGAGATC
GCCGTCGGCG GCCAGCAGAT CGCCGTCAAG GGCCTCACCG GCAACTACGA GAACCTGTTC
CTGCCCCTGT TCGGGGCACA CCAGGCGGGG AACGCCGCCG TGGCCCTGGC CGCGGTCGAG
GCGTTCGCCT CCTCCGGCGA CGACGCCGGG GGCCTGGACC CCGCGATCGT CGCCGAGGCC
CTCGCGGGAG TGGACTCCCC GGGCCGTATG GAGGTCGTGC GCACCAGCCC GACCATCATC
GCCGACGCGG CGCACAACCC GGCCGGGATG ACCGCCACCG CGGCGGCCGT GGAGGAGGCC
TTCACCTTCT CCCGGCTGGT CGGGGTGGTC GCGATCATGG CCGACAAGGA CGTCGAGGGG
ATCCTCGAAC CCCTCGAACC ACTGCTCGAC GAGATCGTCG TCACCCGTAA CTCCTCCCCG
CGTTCCCTCG AACCGGAGCG GCTGTCCAAC GTCGCCCAGC ACATCTTCGG TGAGGAACGC
GTGCACGTGG AGCCCCGACT CGACGACGCC ATCGACCGGG CCGTGGGCCT GGCCGAGGAA
GGCGGGGAGT TCGGCGGCAC CGGTGTACTG GTCACCGGAT CGGTCGTCAC CGCCGGTGAC
GCCGTCCACC TGTTGCGCGG TGCGCAGGAG TGA
 
Protein sequence
MSEASAERRY AEVTAEILAR APESDIDPRL DRVRTLLDLL GDPHRNFRAI HVTGTNGKTS 
TARMIDALMR GRGLRVGRYT SPHLRTVRER IVIDGEPISQ ERFVAAYDDI RPYVEMADSM
NDAPLSFFEI LTVMAYAVFA DAPVDVAVVE VGMGGRWDAT NVIDGDVAVV TPIGIDHTEY
LPDTVEGIAE EKAGIIKPDS VAVLAQQPLP AAEALVRNAA EVGARVAREG LEFGVTSREI
AVGGQQIAVK GLTGNYENLF LPLFGAHQAG NAAVALAAVE AFASSGDDAG GLDPAIVAEA
LAGVDSPGRM EVVRTSPTII ADAAHNPAGM TATAAAVEEA FTFSRLVGVV AIMADKDVEG
ILEPLEPLLD EIVVTRNSSP RSLEPERLSN VAQHIFGEER VHVEPRLDDA IDRAVGLAEE
GGEFGGTGVL VTGSVVTAGD AVHLLRGAQE