Gene Ndas_3133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3133 
Symbol 
ID9246989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3750713 
End bp3751912 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content68% 
IMG OID 
ProductRieske (2Fe-2S) iron-sulfur domain protein 
Protein accessionYP_003681048 
Protein GI297562074 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAGA ACAACAGCAA CGACAGCAGC GGGGGAGAGG CCACGCCCGA ACGCGTGGTC 
GGCACCCCGA AGACGAACGG GACGCACTCC GAGCGCGTGA GCGCGACCCC CACGACCGGC
GGGACCGAGC CCGAACGCGT GGTCGGCACC CCCAAGACCG CGAACCCGCT GGTCTCCGGT
GCCACCGAGG CCCCGGCGGA CGGGCACACG GGTCCGTACA AGGCCTCCGA GACGCACGCG
CGGCCGGAGG AGATGCAGCG CCGGGGTGAG AAGCTCGCCT CGGTGTGGTT CGTGATCGCC
TTCTTGGGCG GGATCGGCTT CCTGGTCGCC TACTTCCTCT TCAGCTCCAG TGCGATCGCG
GACCCGCAGA TCGCCCAGTA CTCGAACATG CTGCTGGGCG GCACGCTGAC GCTGGCGATG
TTCGGCATCG GTGCGGGTAT GACGGTGTGG GCGCGTCAGG TGATGCCGCA CTACGAGGTG
GCCTCTCCCT ACGACGAGCT GCCCTCGGAC GAGAAGGAGA AGGGTTCCTT CAGCGCCTTC
TTCATGCAGG GCGCCGACGA GAGCGGTTTC ACCAAGCGTC CGCTGATGCG CCGCACGCTC
ATCCTGGCGA TGCTGCCGTT GGGGCTGGCG CCGATCGTGC TGCTGCGCGA CACCGGCCCG
GTTCCGGGCG ACCTGATGAA GAAGACGATG TGGGAACACG GTCGCCGCAT CGTCGTGGAG
GGGAGCGGCC GTCCCCTCCG CCCCGAGGAC TTCGAGAACG ACCCCAACGC GATGGTCTCG
GCCCTGCCCG CCGCCGATGA CGACCACCAC CACCTCTCAC TGACCGACCA GGCCAAGACC
GTCATCATCC TGATCAAGAT CCCCGAGGAG GACTGGCAGG AGCGGATGAC CGAGGAGATC
GAGGGAACCG ACGGCCAGAC GCGTCTGAAC TGGACGCACA ACGGCATCGT CGCCTACTCC
AAGATCTGCA CCCACGTGGG CTGCCCCGCG GCCCTGTACG AACGCACGAC ACACCGCATC
CTGTGCCCGT GCCACCAGTC GACGTTCGAC GCCTCCAACG CCGCCGAGGT CGTGTTCGGT
CCGGCGCACC GGCCGCTGCC GCAGCTGCCG ATCGGCGTCG ACGACGAGGG CTACCTCATC
GCGACCGGCG ACTTCGACGA ACCCACGGGC CCCACGTTCT GGGAATACGC GAAGGACTAG
 
Protein sequence
MTENNSNDSS GGEATPERVV GTPKTNGTHS ERVSATPTTG GTEPERVVGT PKTANPLVSG 
ATEAPADGHT GPYKASETHA RPEEMQRRGE KLASVWFVIA FLGGIGFLVA YFLFSSSAIA
DPQIAQYSNM LLGGTLTLAM FGIGAGMTVW ARQVMPHYEV ASPYDELPSD EKEKGSFSAF
FMQGADESGF TKRPLMRRTL ILAMLPLGLA PIVLLRDTGP VPGDLMKKTM WEHGRRIVVE
GSGRPLRPED FENDPNAMVS ALPAADDDHH HLSLTDQAKT VIILIKIPEE DWQERMTEEI
EGTDGQTRLN WTHNGIVAYS KICTHVGCPA ALYERTTHRI LCPCHQSTFD ASNAAEVVFG
PAHRPLPQLP IGVDDEGYLI ATGDFDEPTG PTFWEYAKD