Gene Ndas_5062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5062 
Symbol 
ID9248951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp203742 
End bp204791 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content74% 
IMG OID 
ProductNADH-ubiquinone/plastoquinone oxidoreductase chain 6 
Protein accessionYP_003682949 
Protein GI297563976 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACCCGC TCACCAGCGT CCACGCCGCC CCGCCGTCCG ACCAGCCGCT GTACCTGGCC 
GCGCCGATCG GCGGCGGCGA GGCGACCGCG TTCTGGATCC TGGGCGCCGT GGTCCTCCTC
GGCGCCCTGG GCGTGGTCTT CTCCCGCAAG GCCGTGCACT CGGCGATGTC CATGGCGCTG
ACCATGGTCG GCCTGGCGAT CTTCTACGGC ATCAACGCGG CGCCCTTCCT CATGGTCGTG
CAGATCGTCG TCTACACCGG CGCCGTCCTC ATGCTGTTCC TGTTCGTGCT GATGCTCATC
GGCGTCAGCT CGGCCGACTC CCTGGTGGAG ACGCTCAGCG GACAGCGCCT CATGACGGCC
GTCGTCGCCC TGTGCTTCGT CGGCGCCCTG ACCACGGGCA TCACCCGCAT CGCCGTGGGC
GACCCGGTCG GCCTCACCGA GGGCGCCGCG GCGGCCGGGG GCACCATCCC GTGGATCGCG
GGCGAGCTGC TGCTGCGCTA CGTGGTCGCC TTCGAGGCCA CCGGCGCGCT GCTCATCACC
GCGGTCCTGG GCGCGATGGT CCTGGCGCAC ACCGCCCGCA CCAAGAAGCG CCGCACCCAG
CGGGAGATCG CCGAGGACCG CATCCGCGGC GACCACCCGA CCCCGCTGCC GGGACCGGGC
ACCTACGCCC GGCACAACGC GATCGACATG CCCGCGCTGC TGCCGGACGG CACGGTCTCC
CAGCTCTCGC TCAACCCCGT CCTGGCCGCC CGCGCCCCCG AGTACCAGTC CCGGGTGCCC
GAGACCGTGC GCGCCGGCCA GGCCCCGCTG CCCGGCGGGG CCTCGGACTC GGCCGACTCC
AAGACGGCCG AGCCGGTCGC CGACGACCGG GCCCGGCCCG CCGCGGAGAG GCCCGGGAAC
CGGGTGGGGA ACGGAACGAG GGTCGCGGCC GAGGACGACG CCGACGAGGG CGGCGGCGGG
GCCGTGGGCC CCGACGGTCC CGACGACGCG GACGGGCCGC GCAACGGCAA CGGCAACGGG
CAGGAGGTCG AGTCCTCGTG GACCCGATGA
 
Protein sequence
MNPLTSVHAA PPSDQPLYLA APIGGGEATA FWILGAVVLL GALGVVFSRK AVHSAMSMAL 
TMVGLAIFYG INAAPFLMVV QIVVYTGAVL MLFLFVLMLI GVSSADSLVE TLSGQRLMTA
VVALCFVGAL TTGITRIAVG DPVGLTEGAA AAGGTIPWIA GELLLRYVVA FEATGALLIT
AVLGAMVLAH TARTKKRRTQ REIAEDRIRG DHPTPLPGPG TYARHNAIDM PALLPDGTVS
QLSLNPVLAA RAPEYQSRVP ETVRAGQAPL PGGASDSADS KTAEPVADDR ARPAAERPGN
RVGNGTRVAA EDDADEGGGG AVGPDGPDDA DGPRNGNGNG QEVESSWTR