Gene Ndas_2749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2749 
Symbol 
ID9246600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3290728 
End bp3292413 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content74% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003680668 
Protein GI297561694 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.308133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGCTACA TCGACGTCAA CCAGGTCAGC CACACCCTGC CCGACGGCAG GGTCCTGCTG 
GACTCGGTGT CCTTCCGCGT CGGAGAGGGG TCCAAGACGG CGCTGGTGGG CGCCAACGGC
GCGGGCAAGA GCACCCTGCT GCGGATCGTG CGCGGCGAGC AGCGCCCCCA GGGCGGCGCG
GTCACCCTGG ACGGCGAACT CGGCGTGATG CCGCAGTTCA TCGGGCACGT GCGCGACGAG
ACCACCGTGC ACGAACTGCT GGTGTCGGTC TCCCCCGCTC CGGTGCGCGA GGCGCACGCG
GACCTGGAGG CGGCCGAGGC GGCCATGATG GTCGACGACG GCGAGAGGAC GCAGATGCGC
TACGCCGCGG CCATCGCCCG GTACGGCGAC GCGGGCGGCT ACGACGCCGA GGTCGTGTGG
GACCAGTGCT GCACGGCGGC GCTGGGCGTC CCCTACGAGC GCTGCCGGTG GCGCGAGGTG
CGCACGCTGT CGGGCGGGGA GCAGAAACGC CTGGTCATCG AGGCCCTGCT GCGGGGCCCG
GCGCCGGTGC TGCTGCTGGA CGAGCCGGAC AACTACCTCG ACGTTCCGGG CAAGCGCTGG
CTGGAGGAGG CGCTGCTGGC CACCCCCAAG ACGGTCCTGT TCGTCTCGCA CGACCGCGAG
CTGCTGAGCC GGGTGCCGGA CCGGATCGTC ACGGTCGAAC TGGGCGCCGT GGGCAACACG
GCGTGGGTGC ACGGGGGCGC GTTCGACACC TACCACGACG CCCGCCGGGA GCGGTTCGCG
CGGCTGGACG AGCTGCGGCG GCGCTGGGAC GAGGAGCACG CCAAGCTCAG GGCGCTAGTG
GCCGCCTACA AGCAGAAGGC CGCCTACAAC TCCGACATGG CCTCGCGCTA CCAGTCGGCG
CGGACCCGGC TGCGCAGGTT CGAGGAGGCG GGGCCGCCGC AGAAGCTGCC CCGCGAGCAG
AACCTGCGGA TGCGGCTGCG CGGCGGCCGG ACGGGCAGGC GGGCGCTGGT GTGCGAGGGG
CTGGAGCTGA CCGGGCTGAT GCGCCCGTTC GACCTGGAGG TCTGGTACGG CGAGCGGGTG
GCGGTGCTGG GCTCCAACGG GTCGGGCAAG TCGCACTTCC TGCGGCTGGC GGCCGGGGGC
GGCAGCGATC CCCAGTCGTC GGTGCTGCCG GAGGAGGACA GGGCGAAGGT GCCGCCGGTG
GAGCACAGCG GCACGGCGCG GCTGGGCGCG CGGGTGCTGC CGGGATGGTT CGCGCAGACC
CACGAGCACC CCGAGTTCGC GGGGCGTACG CTGCTGGACA TCCTGCACCG GGGGCAGGGC
GCCCGCCGGG GCGTGCCCCG CGACGAGGCG GGGCGGGCGC TGGACCGCTA CGAGCTGGCT
CCGGCCGCGG AGCAGACCTT CGACACGCTC TCGGGCGGTC AGCAGGCGCG GTTCCAGATC
CTGCTGCTGG AGCTGTCAGG GGCGACGATG CTGCTGCTGG ACGAGCCCAC GGACAACCTG
GACGTGGTGT CGGCGGAGGC GCTGGAGTCG GCGCTGGAGG CCTACGACGG CACGATCCTG
GCGGTGACGC ACGACCGCTG GTTCGCGCGC GGGTTCGACC GGTTCGTGGT GTTCGGCGCC
GACGGAGGCG TGTACGAGGC GCCGGAGCCG GTGTGGGACG AGGGGCGGGT GCGCCGCGAA
CGGTAG
 
Protein sequence
MGYIDVNQVS HTLPDGRVLL DSVSFRVGEG SKTALVGANG AGKSTLLRIV RGEQRPQGGA 
VTLDGELGVM PQFIGHVRDE TTVHELLVSV SPAPVREAHA DLEAAEAAMM VDDGERTQMR
YAAAIARYGD AGGYDAEVVW DQCCTAALGV PYERCRWREV RTLSGGEQKR LVIEALLRGP
APVLLLDEPD NYLDVPGKRW LEEALLATPK TVLFVSHDRE LLSRVPDRIV TVELGAVGNT
AWVHGGAFDT YHDARRERFA RLDELRRRWD EEHAKLRALV AAYKQKAAYN SDMASRYQSA
RTRLRRFEEA GPPQKLPREQ NLRMRLRGGR TGRRALVCEG LELTGLMRPF DLEVWYGERV
AVLGSNGSGK SHFLRLAAGG GSDPQSSVLP EEDRAKVPPV EHSGTARLGA RVLPGWFAQT
HEHPEFAGRT LLDILHRGQG ARRGVPRDEA GRALDRYELA PAAEQTFDTL SGGQQARFQI
LLLELSGATM LLLDEPTDNL DVVSAEALES ALEAYDGTIL AVTHDRWFAR GFDRFVVFGA
DGGVYEAPEP VWDEGRVRRE R