Gene Ndas_4769 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4769 
Symbol 
ID9248652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5659722 
End bp5660789 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content77% 
IMG OID 
Productamino acid-binding ACT domain protein 
Protein accessionYP_003682659 
Protein GI297563685 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTTT CCGAAGGACA GCACGGAACC GACGACCACC GCCACGGCTT CTTCGGCCGC 
GAGGCACTCG ACCTGGGCAC CCTGCTCCTC GCCGCCGGTG CGGCGCACCT GGTGGTGCTC
TCCCTCGGGC ACAGCGACGC GGGCGTCCGC GTCCTGATCA CCGTGGGACT GCTGCTGCTC
GCGGTCTCCG CGGTCCACCG GTGGCGCCGC CACAAGGCCG CGTCCGCTCC CCGGCCGCCC
AGGGGCTCCG GAGCGGTGAA CGCGTCCGGG AGCGCCGGGC CGCCCACGGG TACCGGCCCG
TACGCGAACG GCGGGCTGAC CGGCGGCGAC GGTGCGTCCC GTGGCACCGG GACATCCGAG
GACACGGCGC TGCCCGGGAG CGCCGGGCTG CCCGGGAACG CCGGGTCGGC CGGCGGCGGA
TCCTTCCGAA GCGCCCCCTC GACCGGGGAG GGAGCCCCGG CGTCGGCCGG TTCCCCGGCG
GGTGGGGAAC CGGTGCGCGC CCCGTCCGAC GACCTGCTGT GGAGCGTCCG CGCGACGGTC
GCCGACGTGC CCGGCGGCCT GGCCGCGCTC ACCGCGCGGT TCGCCGCCCT CGGGATCGAC
ATCCGGCTCA TGCAGGTGCA CCCGGCGGGG CCGGACGCCG TGGACGAGTT CTTCGTCAGC
GCTCCCGCGC ACGTGGGAGA GGGCGACCTG TACACCGCCG TACGGGAGGC GGGCGGACGC
GAGGCCGCCG TGCGCCGCGC CGACGTCCAC GAGCTCAGCG ACACCACCAG CCGCACGCTC
GCCCTGGTCA GCGCCCTGGT CACCGGGGCG ACCACGCTGG AGCGCTCGCT GCTCTCCCTG
GCCTCGGCGC GGGCCGTGGA GCACACCGCC GAACCGCCCG CCGGAACGGT CCGCGAGGAC
CTGTCCGGCA CGGTGATGAC CCTTCCGGCA CCCGACGGCG GCGTCCTGAC CGTCCGCCGG
GAGGTCATCC CCTTCACCGC CGTGGAGTTC GCCCGGTGCC GGGCCCTGGC CCACGTCGCC
TCGTCCCTGC ACGCGCGTTC GCACGGCCCG GGACCCGGGA GGCGCTGA
 
Protein sequence
MDVSEGQHGT DDHRHGFFGR EALDLGTLLL AAGAAHLVVL SLGHSDAGVR VLITVGLLLL 
AVSAVHRWRR HKAASAPRPP RGSGAVNASG SAGPPTGTGP YANGGLTGGD GASRGTGTSE
DTALPGSAGL PGNAGSAGGG SFRSAPSTGE GAPASAGSPA GGEPVRAPSD DLLWSVRATV
ADVPGGLAAL TARFAALGID IRLMQVHPAG PDAVDEFFVS APAHVGEGDL YTAVREAGGR
EAAVRRADVH ELSDTTSRTL ALVSALVTGA TTLERSLLSL ASARAVEHTA EPPAGTVRED
LSGTVMTLPA PDGGVLTVRR EVIPFTAVEF ARCRALAHVA SSLHARSHGP GPGRR