Gene Ndas_2921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2921 
Symbol 
ID9246773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3492867 
End bp3494078 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content73% 
IMG OID 
Productacyl-CoA dehydrogenase domain protein 
Protein accessionYP_003680837 
Protein GI297561863 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.296233 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTCG CCCACGACAG CGAGACCGAG GAACTGCGCG AACGGCTGCT CGCCTTCATG 
GACGAGCGCG TCTACCCGGC CGAGGCGGTG CTGGAGGAGC AGCTGGCCGT GCGCGAGGAC
CGCTGGTCCA CCGCGCCGGT GGTGCGCGAG CTCCAGGCCG AGGCGCGCGA GCGCGGGCTG
TGGAACCTGT TCCTGGCGGG CCACCCCGAA CACGGCGGCC TGCCCAACCT GCGGTACGCG
CCGCTGGCGG AGATCACCGG GCGCAGTCCG CGCCTGGCGC CGATCGCGCT CAACTGCGCC
GCGCCCGACA CCGGGAACAT GGAGGTGCTG ACGATGTTCG GCACCCCCGA GCAGCGCGAG
CGCTGGTTGG AGCCCCTGCT CGACGCCCGG ATCCGGTCGG CGTTCGCGAT GACCGAGCCC
GCGGTGGCCT CCTCCGACGC CACCAACATC ACCACCAGCA TCGTCCGCGA CGGCGACGAG
TACGTGGTCA ACGGACGCAA GTGGTACATC ACCGGGGCCC TCAACCCCGA GTGCGCCGTG
TTCATCGTGA TGGGCAAGAC CGACCCCGAA GCCGAGCGGC ACAGGCAGCA GAGCATGGTG
CTGGTGCCGC GCGACACCCC CGGGCTGACG GTCAGGCGGG GCATGACGGT GTACGGCTAC
GACGACAGCG ACCACGGCGG CCACGCCGAG GTCGTGTTCG AGGACGTGCG GGTGCCCGCC
TCGAACCTGG TCGGCGGCGA GGGCGAGGGG TTCGCCATCG CCCAGGCCCG GCTGGGGCCG
GGCCGCATCC ACCACTGCAT GCGCGGCATC GGCATGGCCG AGCGCGCCCT GGAGCTCACC
TGCCGCCGGG TGCTGGACCG CGTGGCCTTC GGGAGGCCGC TGGCCGAACA GGGCGTGGTC
CGCGAGTGGA TCGCCGAGGC GCGCGTGGCC ATCGAGCAGC TGCGGCTGCT GGTGCTCAAG
ACCGCGTACC TGATGGACAC CGTGGGCAAC AGGGGCGCGC ACACCGAGAT CCAGGCGATC
AAGATCGCCA CCCCGCGCAC GGTGGAGTGG ATCCTGGACA AGGCGATCCA GGCGCACGGC
GCGGCCGGGG TCAGCCAGGA CCTGCCCCTG GCCGGATGGC TGGCGGGGGT GCGGTCGCTG
CGGCTGGCCG ACGGGCCCGA CGAGGTGCAC CTGCGCTCGC TGGGCCGCGC GGAGCTGCGC
AAGTACCGCT AG
 
Protein sequence
MDFAHDSETE ELRERLLAFM DERVYPAEAV LEEQLAVRED RWSTAPVVRE LQAEARERGL 
WNLFLAGHPE HGGLPNLRYA PLAEITGRSP RLAPIALNCA APDTGNMEVL TMFGTPEQRE
RWLEPLLDAR IRSAFAMTEP AVASSDATNI TTSIVRDGDE YVVNGRKWYI TGALNPECAV
FIVMGKTDPE AERHRQQSMV LVPRDTPGLT VRRGMTVYGY DDSDHGGHAE VVFEDVRVPA
SNLVGGEGEG FAIAQARLGP GRIHHCMRGI GMAERALELT CRRVLDRVAF GRPLAEQGVV
REWIAEARVA IEQLRLLVLK TAYLMDTVGN RGAHTEIQAI KIATPRTVEW ILDKAIQAHG
AAGVSQDLPL AGWLAGVRSL RLADGPDEVH LRSLGRAELR KYR