Gene Ndas_4196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4196 
Symbol 
ID9248070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5010647 
End bp5012389 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content75% 
IMG OID 
ProductABC transporter transmembrane region 
Protein accessionYP_003682095 
Protein GI297563121 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00643785 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGACAC TTCCCCAGCC CACCGGCGTT CCCGAACGCC GTTCGGCCGA CCGGTTCCTG 
TGGTGGCTCG CCCGGAAGGA GTGCCGCTCA CTGCTGTGGG CCGGGACCAC CACGGGCGCC
TTCATGCTCA CCGGCGTGCT CATCTCCGCC GCGCTCGGCG CCGCCCTCGA CTCCGGGGTC
ACCCGAGGCG ACCCGGACGC CCTCCTCGGC TGGTTCGGCG TCCTCAGCGC CCTCGTCGCG
GTCTCCGTCG CACTCGTCCC CCTCTCCCAC CGGGCCGACT GCTTCAGCTG GTACGCGTCC
GCCTACCGCA CGATCCAGGT GGTCACCAAC CATTCCGTGC GCATGGGCCC CACGCTCACC
CGGCGTCTGC CCTCCGGCGA GGTGGTGGCC GTGGGCACCG AGGACATCGA CCGCGTCGGC
GACTTCTACG AGACGGTGGG CCAGCTCCTG GCCGTCTTCG TGACGGTGGC CGCGGTCAGC
GCCCTCATGC TCACCTCGCA CGTCGCCTTC GGCGTCATCG TCCTGGTGTC GGTCCCGCTC
ATCATGGCGG GGATGGTCCC CCTGCTGAAC CTGTTCTCCC GCCGCCAGGA GCACCAGCGC
GACCGGCAGG CCGAACTCAC CACCCTGGCC ACCGACCTGG TCGCGGGCCT GCGCGTGCTC
CGGGGCGTGG GCGGCGAGCG CACCGTGGAC CACCGCTACC GCACCGCCTC CCAGGGGGTC
CGCTCCGCCG CCATGCGCGT GGGCTGGGTG GACGCCGCCC TCAACGCCGT GCGCGAGCTC
CTCCCGGGCC TCCTCCTCGT GGGCATCGTC TGGTACGGCG CGCGCCTGGC CCTGACCGGG
GAGATCACCG TCGGCCAGCT CGTCGCGTTC TACGGCTACG CCGGAACGCT CTCCATGGCC
GTGCGCCACC TGATGCGTTC CCTGTCCATG TACGTCTCGG CCCGGGTGGC CGCCGGTCGC
GTGGTGCGCG TGCTGCGCCT GGAGCACGAC CACCCCGACC CCGAGCGCCC CGAGCCGGAA
CCGGCCGGCC CCTGCGACCT GCACGACCCG GGCAGCGGAG TGACCCTCGC CGCGCACCGC
ACCACCGGCG TGGTCTGCGC CGACCCCGCC GACGCCACCG CCATCACCGA ACGCCTGGGG
CGCTACCGGG ACGAGGAGGG CGCGCACGCC GCACTGGTCG AGACGGCCAC GGGCCGGACC
GTGCGCCTGC GCGACCTGCC CCGGTCCCGG GTGCGCCGCC GCGTCCTGGT GGCCGCCAAC
GACGCGCATC TGTTCTCCGG AACCCTGCGC GGTGAACTGT CCCCGGACCA GGGGATCTCC
GACGAACGCC TGGCCGAGGT CGTGCGCGCC GCGCACGCCG ACGACGTGGT GCGCCAGTCC
GCCGCCGGAC TGGACGCGCT GCTGACCGAG CGCGGCCGCG AGTACTCCGG CGGCCAGCAG
CAGCGCCTGC GCCTGGCCCG CGCCCTGGCC CTGGAGCCGG AGATCCTGAT CCTGGTGGAG
CCCGCCTCCG CCGTGGACGC CCACTCCGAG GCCGCGATCG CCGCCGGGCT GCCCGCCGAG
CGGGCGGGGC GCACGACCGG CCTGGTCACC ACCAGCCCCC TGCTGCTCGA CCACACCGAC
CACGTGCAGT TCGTCGAGGA GGGCCGCCTC ACGGCCGAGG GGACCCACCG CGCCCTCCTC
GCCGAGTGCC CGAACTACGC CGCCACCGTC CTGCGCGCCG TTCCCACGGA GGGGAAGCCA
TGA
 
Protein sequence
MRTLPQPTGV PERRSADRFL WWLARKECRS LLWAGTTTGA FMLTGVLISA ALGAALDSGV 
TRGDPDALLG WFGVLSALVA VSVALVPLSH RADCFSWYAS AYRTIQVVTN HSVRMGPTLT
RRLPSGEVVA VGTEDIDRVG DFYETVGQLL AVFVTVAAVS ALMLTSHVAF GVIVLVSVPL
IMAGMVPLLN LFSRRQEHQR DRQAELTTLA TDLVAGLRVL RGVGGERTVD HRYRTASQGV
RSAAMRVGWV DAALNAVREL LPGLLLVGIV WYGARLALTG EITVGQLVAF YGYAGTLSMA
VRHLMRSLSM YVSARVAAGR VVRVLRLEHD HPDPERPEPE PAGPCDLHDP GSGVTLAAHR
TTGVVCADPA DATAITERLG RYRDEEGAHA ALVETATGRT VRLRDLPRSR VRRRVLVAAN
DAHLFSGTLR GELSPDQGIS DERLAEVVRA AHADDVVRQS AAGLDALLTE RGREYSGGQQ
QRLRLARALA LEPEILILVE PASAVDAHSE AAIAAGLPAE RAGRTTGLVT TSPLLLDHTD
HVQFVEEGRL TAEGTHRALL AECPNYAATV LRAVPTEGKP