Gene Ndas_4197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4197 
Symbol 
ID9248071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5012386 
End bp5014326 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content75% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003682096 
Protein GI297563122 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.329748 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGGG CCGCCGACAA CGCCGAGACC CTCGGCCCGC CGCAGGCGTC CGGAGTGTTC 
GAAACGCCCC GAGCGCCCGA GGCGGCCCGC GCACCCGGAG CGCCCGCCGA TCCCGCCGCG
TCCGCCGCTC CCGCGGAGCA CCGCGCGGCG GGACTGCCGA TCGCGCCGGT CGCTGCCGTG
TGGCGGCGGC TGCGCCGTGC CGGGCGCGAG CACGGACGGC TGCTCGCCGT CGTCGTCCTG
CTGTACGGGA CGGCCGCGCT CACGGCGCTG GCCTCGCCCT GGATCCTGGG TCTGATCATC
GACACCGTGC GCGCCGAGGG GGCGCCGTCC GCGGCCTCGG AGCAGGCGGC CTCGCGCGTC
GACGTGCTCG CCGGGCTCAT CGTCGCGGCG CTGGTGCTGC ACGCCGGGTT CACCCTGGCC
TCCGTGGCGG CCTCGATCCG GTTCGGCGAG AGCGTGCTGG CCGAGCTGCG CGAGGAGTTC
GTGCGCGCGG TGCTCCGGCT CCCGATGGGA GTGGTGGAGC GGGCGGGCAC GGGCGACCTC
GTGGCGCGCA CCGGCCGCGA CATCGGCCAC CTGAGCCACA CCGTGCGCGT TTCGGTGCCG
GTGATGGCGG TGAGCACGGT GACCCTGGTG GTCGTCACCA CGGCGCTCAT CGTCCTGCAC
CCTCTCCTGC TGCTGGCGTG GCTGCCGTCG GCGCCCGTGC TGTGGCTGTC CACGCGCTGG
TACGCGCGCA GGGCCCCGGA CGGGTACGTG CGCGAGCTGG GCACCTACTC GGAGCTGACC
CAGAGCGTGA CCGACACCGT GGAGGGCGCG CACACCATCG AGTCGCTGGG ACGCCAGGCC
CGGCGGATCG CGCTCAACGA ACAAAGGGTG GGGCGGGCCT ACGCGGCGGA GCGGTACACG
CTGTGGCTGC GCTGCGTCTG GTACCCGCCG CTGGAGTTCG GGTACATGTT CCCGATCGCG
CTGACCTTCC TGGTGGCTGG CCTGCTGTAC GCCGACGGAG CGCTGAGCCT GGGCGGGATC
GCCACCGCGG TCTTCCTCAG CAGGCAGATG GCGCGGCCCC TGGACCAGCT GCTGGACCAG
GTGGACAGCC TGATGATGGG CTTCACGAGC ATGCGCAGGC TGCTGGGCGT GGAGCTGGCC
GGGGAACCGG AGGGGAGGGC GCGCTCCGCG GCGGACGCGG CCGGGACCGC CCGGCCCGGC
GAGGTGCGGG TGGAGGACGT GCGCTTCGCC TACACCGACG CGGAGGTCCT GCACGGCGTC
GACCTGGTGC TGGCGCCCGG TGAACGCCTG GCCGTGGTGG GTCCGAGCGG CGCGGGCAAG
TCGACCCTGG GCAAGCTGAT CGCGGGCGTG CACCCGCCCA CCTCGGGCGC CGTCCGCGTG
AGCGGTGCGC CGGTGTCGGG CCTGCCTCCC GAGGAACGGC GCGCGCGGGC GATCCTGCTG
AGCCAGGAGA GCCACATGTT CCGGGGCACG ATCGCCGAGA ACCTGGCGTT GGCGCTGGAC
CGGCCCGAGG GCGCGGCCGA GGTGGACGAG GAGCGCCTGT GGGAGGCGCT GGCCGCCGTG
GACGCCGAGC CCTGGGTGCG CGCGCTGCCG GAGGGGCTGG GCACGCGGGT GGGGTCGGGC
CACGCCCCGC TGGACCCGGC GCACGTGCAG CAGCTGGCCC TGGCCCGCGT GGTGCTGGCC
GACCCCGACG TGCTGGTGCT GGACGAGGCC ACGTCCCTGA TGGACCCGCG TTCGGCCCGC
CACCTGGAAC GCTCGCTGGC GGGGGTGCTG TCGGGGCGCA CGGTGGTGGC CATCGCGCAC
CGGCTGCACA CCGCGCACGA CGCCGACCGG ATCGCGGTGG TGGAGGACGG CCGGATCAGC
GAGCTGGGCA GCCACGACGA GTTGCTGGCC CGCGGCGGCT CCTACGCCGA CCTGTGGCGG
GCCTGGCACG GCGAGGACTG A
 
Protein sequence
MTRAADNAET LGPPQASGVF ETPRAPEAAR APGAPADPAA SAAPAEHRAA GLPIAPVAAV 
WRRLRRAGRE HGRLLAVVVL LYGTAALTAL ASPWILGLII DTVRAEGAPS AASEQAASRV
DVLAGLIVAA LVLHAGFTLA SVAASIRFGE SVLAELREEF VRAVLRLPMG VVERAGTGDL
VARTGRDIGH LSHTVRVSVP VMAVSTVTLV VVTTALIVLH PLLLLAWLPS APVLWLSTRW
YARRAPDGYV RELGTYSELT QSVTDTVEGA HTIESLGRQA RRIALNEQRV GRAYAAERYT
LWLRCVWYPP LEFGYMFPIA LTFLVAGLLY ADGALSLGGI ATAVFLSRQM ARPLDQLLDQ
VDSLMMGFTS MRRLLGVELA GEPEGRARSA ADAAGTARPG EVRVEDVRFA YTDAEVLHGV
DLVLAPGERL AVVGPSGAGK STLGKLIAGV HPPTSGAVRV SGAPVSGLPP EERRARAILL
SQESHMFRGT IAENLALALD RPEGAAEVDE ERLWEALAAV DAEPWVRALP EGLGTRVGSG
HAPLDPAHVQ QLALARVVLA DPDVLVLDEA TSLMDPRSAR HLERSLAGVL SGRTVVAIAH
RLHTAHDADR IAVVEDGRIS ELGSHDELLA RGGSYADLWR AWHGED