Gene Ndas_0029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0029 
Symbol 
ID9243856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp37254 
End bp38774 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content71% 
IMG OID 
ProductAbgT putative transporter 
Protein accessionYP_003677987 
Protein GI297559013 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.13432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.390829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCTGC TCCTGGGCTC CCTCCAGGTG ATCGAGCGGG TGGGCAACAA GCTGCCCCAC 
CCGTTCTGGC TGTTCACGAT CATGGCCGGG ATCGTCATCG CGCTCAGCGC CCTGCTCAAC
GCGCTGGGCG TCTCCGCGGT CTCACCGGTG GACGGCGAGC GCATCGCGGT CCGCAGCCTG
CTCTCGCCCG AGGGCGTGCA GACCATCGTC GGCGACGCGA TCGACAACTT CGCGACCTTC
CCGCCGCTGG CGATCATCAT CGTCGTCATG CTCGGGGTCT CGGTCGCCGA GCGCAGCGGC
CTGCTCAACG CGATGCTGCG CGGCAGCGTC ACCAGGGTCC CGGCCCGGTG GCTGACCTTC
GCCGTGGCGC TGACCGGCAT CACCGGCAGC GTCGCCTCCG ACGCCGCCTA CGTCGTGCTC
ATCCCGCTGG GCGCCCTGCT CTTCAAGGCC GCGGGCCGCA GCCCCGTCCT CGGCCTGGTC
GTGGCCTTCG GGTCCGTCTC CGCCGGGTAC AACGCGTCAC TGCTCATCAC GCCCACCGAC
GCGCTCCTGG CCGCACTGAG CACCGAGGCC GCCTCGATCA TCGACCCCGG CTACACGGTC
ACGGCGCTGG ACAACTACTT CTTCAGCGTC GTCTCCGCCG TCGTGCTCGC CGCCCTGATC
ACCCTCGTCA CCGAGTTCCT GCTGAGCAGG GGCTCCGGGA GCCTGGCCGA GGACGGCGAC
GGCGAGGGGG ACCGGGAGGC ACAGGAGCTG GGCTCCATGA CCCTCTCCGA CGCCGAGCGC
CGGGGGCTGC GCAACGCCGG GCTGACCGTG GTGGCGGCCG TGGCGGTCCT CGCGGCGGCG
CTCGCGCCCC CCGCCTCGCC CCTGCGCGGG GAGGACGGGA CGGTCCTCGG CTCGCCGATC
ATCACCGGTG TCGCCTATGT GCTGGGCGTC CTGTTCCTGC TCGCCGGAGT CGTGTACGGG
CGCGCCACGG GCACCGTGGC CTCGGCGCGG GACGTCCCCG AGGCGATGAC CGCGGGGGTG
CGCGACCTGG CGCCGGTGGT CGTGCTGTTC TTCGCCGCCT CGCAGTTCCT CGCCTACTTC
CGCTGGACCG GGATCGGCGA GATCGTGGCC ATCCGCGGCG CCGCGCTGCT GGACTCGGCG
GGGGTCCACC CCCTGGTGCT CTTCCTCGGC ATGATCGTGT TCTCCTCGCT GCTGAACCTG
CTCATCACCA GCGGTTCGGC GCAGTGGACC CTCATCGCGC CGGTCTTCGT GCCCATGTTC
ATGCTGCTGG ACGTGCCGCC CGAGACGACC CAGGCCGTCT ACCGCATCGC CGACTCCAGC
ACCAACGTCA TCAGCCCGAT GAGCCCGTAC TTCGTCATGG CGCTGGGCTT CCTCCAGCGC
TACCGCCGGG ACGCGGGCAT CGGCACGCTG ATCTCCCTGA CGCTGCCGCT GTGCCTCACC
GTCCTGGTCG GCTGGACCCT GCTGTTCCTG GGCTGGTGGG CGCTCGACAT CCCGCTGGGA
CCGGGTGTCC CGGTGCGCTG A
 
Protein sequence
MRLLLGSLQV IERVGNKLPH PFWLFTIMAG IVIALSALLN ALGVSAVSPV DGERIAVRSL 
LSPEGVQTIV GDAIDNFATF PPLAIIIVVM LGVSVAERSG LLNAMLRGSV TRVPARWLTF
AVALTGITGS VASDAAYVVL IPLGALLFKA AGRSPVLGLV VAFGSVSAGY NASLLITPTD
ALLAALSTEA ASIIDPGYTV TALDNYFFSV VSAVVLAALI TLVTEFLLSR GSGSLAEDGD
GEGDREAQEL GSMTLSDAER RGLRNAGLTV VAAVAVLAAA LAPPASPLRG EDGTVLGSPI
ITGVAYVLGV LFLLAGVVYG RATGTVASAR DVPEAMTAGV RDLAPVVVLF FAASQFLAYF
RWTGIGEIVA IRGAALLDSA GVHPLVLFLG MIVFSSLLNL LITSGSAQWT LIAPVFVPMF
MLLDVPPETT QAVYRIADSS TNVISPMSPY FVMALGFLQR YRRDAGIGTL ISLTLPLCLT
VLVGWTLLFL GWWALDIPLG PGVPVR