Gene Ndas_5564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5564 
Symbol 
ID9249467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp764528 
End bp765568 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content68% 
IMG OID 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_003683449 
Protein GI297564476 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.271716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.315175 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGAGC GGACCATGAC CAAGGGCGAA CCGCCCTCCC CGGCCGACGC GTCCCGCGGG 
CGCGGGGGCC GGGGGCGGTC CCCGGCCGTC CCGGACCGCC GGGGCGGGTC CGCGTCCCCG
AGCCGCAGGC GGGTCCCGTT CGGTGCGAGG CTGCGCCGCG ACTGGCAGCT GCTGCTGATG
ACGGTCCCGG CGATCGGCCT GCTGGCGGTG TTCCACTACA CGCCGACCCT CGGCAACATC
ATCGCCTTCC AGGACTACAA CCCCTGGGAC GGGGTGTGGG GCAGCCCGTG GGTGGGGCTG
GCGCACTTCG AGCGTCTGTT CACCGACCCC CGCTTCTGGT CCGCGGCGGG CAACACGCTG
GTCATCGCCG CCGTCCAATT GGTGTTCTTC TTCCCCATCC CGATCGCGCT GGCCATCCTG
CTGGACAGCG TCCTCAGCCC CAGGCTGCGG ATGGTGCTCC AGAGCATCGT GTACATGCCG
CACTTCTTCT CGTGGGTCCT GGTCGTCACC CTGTTCCAGC AGATCCTGGG CGGCGCCGGA
CTGTTCTCGC AGATCCTGCG GCAGAACGGG TACGCGCCGC TGGAGGTCAT GTCCGATCCC
GACGCGTTCC TGTTCGTGGT CACCTCCCAG ATGGTCTGGA AGGACGCCGG GTGGGGCACG
ATCATCTTCC TGGCGGCGCT GGCGGCCGTG AACCAGAACC TCTACGAGTC CGCGGCCGTG
GACGGCGCCG GGCGATGGCG GCGGATGTGG CACATCACCC TGCCGGGCCT GCGCCCGGTG
ATCGTCCTGC TGCTCATCCT CAAGATCGGC GACATCCTCA ACGTCGGCTT CGAGCAGTTC
TACCTCCAGC GCGACGCGTT CGGATCGGGC GTGTCGGAGG TGCTGGACAC CTTCATCTAC
CACCAGACCC TGGTGACGGG GAACTTCAGC GCGGGAGCGG TCGCGGGCCT GGTCAAGGGC
GTGGTCGGAC TGGTCCTCAT CGTTCTGGCC AACAAGCTGG CCCACAAGAT GGGTGAGAAC
GGAATCTACC GACGAGCATG A
 
Protein sequence
MTERTMTKGE PPSPADASRG RGGRGRSPAV PDRRGGSASP SRRRVPFGAR LRRDWQLLLM 
TVPAIGLLAV FHYTPTLGNI IAFQDYNPWD GVWGSPWVGL AHFERLFTDP RFWSAAGNTL
VIAAVQLVFF FPIPIALAIL LDSVLSPRLR MVLQSIVYMP HFFSWVLVVT LFQQILGGAG
LFSQILRQNG YAPLEVMSDP DAFLFVVTSQ MVWKDAGWGT IIFLAALAAV NQNLYESAAV
DGAGRWRRMW HITLPGLRPV IVLLLILKIG DILNVGFEQF YLQRDAFGSG VSEVLDTFIY
HQTLVTGNFS AGAVAGLVKG VVGLVLIVLA NKLAHKMGEN GIYRRA