Gene Ndas_0031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0031 
Symbol 
ID9243858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp40318 
End bp42354 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content73% 
IMG OID 
Productphenylacetic acid degradation protein paaN 
Protein accessionYP_003677989 
Protein GI297559015 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.300256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.45539 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGAAA CACTCCAGAG CTACGCGCGG GGAGCATGGT TCACCCCCGC GGACGAGGGC 
GCGCCCCTGG CCGACGCGAA CACCGGGGAG ACCGTCGCAC GGATCTCGTC CGACGGCCCC
GACGTCGCGG GGATGGTCGA CCACGCGCGG ACCGTCGGCG GCCCCGCGCT GCGCGCGCTC
ACCTTCCACC AGCGCGCGAA CATCCTCAAG GAGGTCGCCA AGCACCTGAC GGGGTACAAG
GAGGAGTTCT ACGCCCTCTC CCACCGGACC GGCGCGACCG CACGGGACAC CGCCGTGGAC
GTGGACGGCG GTTTCGGCAC GCTCTTCAGC TTCTCCAGCA AGGGCCGCCG CGAGCTGCCC
AACTCCACCG TCATCCTGGA CGGGCCCCTG GAGCCGCTCG GCAGGCAGGG CACGTTCGTG
GGCCAGCACG TGTACACGTC CCGACCGGGC GTGGCCGTGC AGATCAACGC CTTCAACTTC
CCGGTGTGGG GCATGCTGGA GAAGTTCGCC CCCGCCTTCC TCGCGGGGAT GCCCAGCATC
GTCAAGCCCG CCGGGCAGAC CGCCTACCTC ACCGTCGCGG TCGTGCGGCG CATGGTCGAG
TCGGGCCTGC TGCCCGAGGG CTCCCTCCAG CTCCTCGTCG GCAGCCACCG GGGACTGCTC
GACGCCCTGG GTCCGCAGGA CGTCGTGGGC TTCACGGGGT CCGCCGCCAC CGGCGCCATC
CTGCGCAACC ACCCCAACGT GGTCAGCGGA GGCGTGCAGC TCAACGTCGA GGCCGACTCC
CTCAACTGCT CGATCCTCGG CCCGGACGTG ACCGAGGAGG ACCCCGAGTT CGACCTGTAC
GTCAAGCAGG TCGTCACCGA GATGACCGTC AAGGCCGGGC AGAAGTGCAC CGCCATCCGC
CGCGTCATCG TGCCCGCGTC GATGGCGGAG GCCGTCACCG GGGCCCTGAC CGAACGCCTG
GCCAGGGTGG TGGTCGGCGC GGCCGACCAC CCGGACACGC GGATGGGCGC CCTGGTCTCG
CTCGACCAGC GCGAGGAGGT CCGCAAGGCG GTCAAGGCCC TGCGCGCCAC CAGCGAGCTC
GTCTACGGCG ACCCGGAGAG TGTGGAGGTC GCCGGGGCCG ACGACGAGTC GGGCGCGTTC
ATGTCCCCGA TCCTGCTGCG CGCCGAGCCC GGCGCCCGGG AGCCGCACGA GGTGGAGGCC
TTCGGCCCCG TCAGCACCGT GATCACCTAC GACGGCGTCG CCGAGGCCGT GGAGCTGGCC
GCCCGGGGCA GCGGCAGCCT GGTGGGGTCG CTCGTCACCC GCGACCCGGA CGTGGCGCGC
GAGGTCGTCC TGGGGCTGGC GCCCTGGCAC GGGCGCATCC TGGTGCTCAA CCGCGACGAC
GCCAAGGAGT CCACCGGCCA CGGCTCCCCG CTGCCCGTGC TGGTGCACGG CGGACCCGGC
CGCGCGGGCG GCGGCGAGGA GATGGGCGGC GTGCGCGGCG TCAAGCACCA CATGCAGCGC
ACCGCCGTGC AGGCCCCGCC GGACATGGTC ACCGCGATCA CCGGCCACTG GACCACCGGC
TCCGAGCGGA CCGTGGGCGA CGTGCACCCG TTCCGCAAGG ACCTGTCCCA GCTGCGGATC
GGCGACACGA TCCGGTCGGC GGAGCGCACC GTCACCCGGG CCGACATCGA CCACTTCGCC
GAGTTCACCG GCGACACGTT CTACGCGCAC ACCGACGAGG AGGCCGCCGC CGCCAACCCG
CTGTTCGGCG GGATCGTGGC GCACGGCTAC CTGGTGGTGT CACTGGCGGC GGGCCTGTTC
GTGGACCCGG CCCCGGGCCC GGTGCTCGCC AACTTCGGCG TGGACAACCT GCGCTTCCTC
ACCCCGGTGA AGGAGGACGC CACCATCCGG GTGACGCTGA CCGCCAAGCA GATCACCCCG
CGCACGAACG CCGACTACGG CGAGGTGCGC TGGGACGCCC TGGTCACCGA CCAGGACGGC
GAGGCCGTGG CCACCTACGA CGTGCTCACG CTGGTCGCCA AGGGCGGGGA GGGGTAG
 
Protein sequence
MPETLQSYAR GAWFTPADEG APLADANTGE TVARISSDGP DVAGMVDHAR TVGGPALRAL 
TFHQRANILK EVAKHLTGYK EEFYALSHRT GATARDTAVD VDGGFGTLFS FSSKGRRELP
NSTVILDGPL EPLGRQGTFV GQHVYTSRPG VAVQINAFNF PVWGMLEKFA PAFLAGMPSI
VKPAGQTAYL TVAVVRRMVE SGLLPEGSLQ LLVGSHRGLL DALGPQDVVG FTGSAATGAI
LRNHPNVVSG GVQLNVEADS LNCSILGPDV TEEDPEFDLY VKQVVTEMTV KAGQKCTAIR
RVIVPASMAE AVTGALTERL ARVVVGAADH PDTRMGALVS LDQREEVRKA VKALRATSEL
VYGDPESVEV AGADDESGAF MSPILLRAEP GAREPHEVEA FGPVSTVITY DGVAEAVELA
ARGSGSLVGS LVTRDPDVAR EVVLGLAPWH GRILVLNRDD AKESTGHGSP LPVLVHGGPG
RAGGGEEMGG VRGVKHHMQR TAVQAPPDMV TAITGHWTTG SERTVGDVHP FRKDLSQLRI
GDTIRSAERT VTRADIDHFA EFTGDTFYAH TDEEAAAANP LFGGIVAHGY LVVSLAAGLF
VDPAPGPVLA NFGVDNLRFL TPVKEDATIR VTLTAKQITP RTNADYGEVR WDALVTDQDG
EAVATYDVLT LVAKGGEG