Gene Ndas_1035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1035 
Symbol 
ID9244881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1274466 
End bp1276325 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content76% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003678984 
Protein GI297560010 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00381243 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0281832 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCG CACAGGACAC CTCCCCGCGC ACACAGGCCG CGCCGGAGCC GGGGACCGGG 
CGCGCGCCCT CCCCGCTCGG CGTCCTCCTG GAGCCGATCC GGGGGCGCGT CCGCGCTGCC
GTGGTCCTCC AGGGCCTGGC CAGCGCGCTC GGCCTCGTCC CCCTGATCTG CCTGGCCGAG
GCCGCCGCCG CGCTGCTGGC CGACGGGCCG ACCGACACCG CCCTGGTGTG GACGCTGGTC
GCGGTCGCGC TGGCCGGAGC GGTCGCCGCC CTGGCCGCGG GCACCGCCTC CACCCTGGTC
GGGCACCTGG CCGACAACGA CATGCAGCTG TCCGTGCGGC GCGCCCTGGC CCGCCACCTG
GGGCGGGTTC CGCTGGGCTG GTTCTCCGGC CGCGGCTCGG GCCGGGTCAA GAAGGCCCTG
CACGACGACA TCGAGGACGT CCACTCCCTG GTCGCCCACA CCCTCCCCGA CCTGGCCGCC
GTCGTGGCGG TCCCCGTGGT CGCCCTGGCC TACCTCGCCT CCGTGGACTG GCGCCTGACC
CTGGTCGCGG TGCTGCCGGT GGCCGCGGGA GTCCTGCTGT TCGCGCGGGC CATGGCCGGG
TCCATGAAGA AGATGGCCGA GTACGCCGAG GCGATGGGCG CGGTCAACAC CGCGGCCGTG
GAGTTCGTGG ACGGCATCCA GGTGGTCAAG CACTTCGGCG GCCACCGCAG GGCCCACGAG
CGCTTCACCC GGGCCGCCGA CGCCTTCGCG GACTTCTTCG TCTCCTGGTC GCGCGCCACC
ACCCCGGCCA CGGTCGCCTC CTTCCTGGTG CTGTCGGCGC CCACGGTCAC CGTCACCGTG
GTGGCGGCGG GGGCGGGCTT CGCCGCCCTG GGCTGGAGCG AACCCGTCTC GGTGGTGGCC
TTCGCCCTGC TGGCCCCTGC GCTGTGCGCG CCGATGAACG TGATCGGCTC GCGGGTGCAG
CAGATCCAGA CGGCGCGGGC CGCCGCCGGA CGGGTCCGCG ACCTGCTCGC CACCCCGCCC
CTGCCCGAGG GCTCCGGCGG GGGACCGCGG GGCGCCCGCG TGGTCTTCGA CGGGGTGCGC
TTCGCCTACC CGGCCCAGGA CGGCGCCGAG GGGAGCGGAG CGGGCGAGGA GGTCCTGCGC
GGCGTGGACC TGGTCCTGGA ACCGGGCACC GTGACCGCCC TGGTCGGCCC GTCCGGCGCG
GGCAAGACCA CCCTGGCCAC CCTGCCCGCG CGCTTCGCCG ACGTCACGGC GGGCGCGGTC
ACCGTGGGCG GGGCGGACGT GCGCGACATC CCCGCCGAGG AGCTGTACCG CACGGTCGGC
TTCGTGTTCC AGGACGTGCG GCTGCTGCGG GAGACCGTGG CCGACAACAT CGCCATGGGC
CGCCCCGGGG CGACCCGTGA GGAGGTGGAG GAGGCCGCCC GCGCCGCCCG CGTCGCCGAA
CGGATCGAGG CGCTGCCGCG CGGCTACGAC TCCGTGGTGG GCGAGGACGC CGACCTGTCC
GGCGGTGAGG CCCAGCGGGT CTCCATCGCC CGCGCCCTGC TCGCCGACAC CCCCGTGCTG
GTGCTGGACG AGGCCACCGC CGCCGTGGAC CCGGTGTCGG AGGCGGCCAT CCAGGACGCC
CTCGGCGAAC TGGCCCGGGG CCGCACGGTC CTGGTGATCG CGCACCGGCT GAGCACCGTG
GCCGGGGCCG ACCTCATCGC GGTCATGGAC GAGGGGAGCG TGGTGGAGCG CGGCACCCAC
GGCGAGCTGC TGGCGCGCGG CGGCCGGTAC GCCGACCTGT GGCGCGCCCA GCACCCGGAG
GCCGGTGACG GGGACGCCCA TGACGGGGCC GCACGAGACG AACCGGGAGA AGACCAGTGA
 
Protein sequence
MSTAQDTSPR TQAAPEPGTG RAPSPLGVLL EPIRGRVRAA VVLQGLASAL GLVPLICLAE 
AAAALLADGP TDTALVWTLV AVALAGAVAA LAAGTASTLV GHLADNDMQL SVRRALARHL
GRVPLGWFSG RGSGRVKKAL HDDIEDVHSL VAHTLPDLAA VVAVPVVALA YLASVDWRLT
LVAVLPVAAG VLLFARAMAG SMKKMAEYAE AMGAVNTAAV EFVDGIQVVK HFGGHRRAHE
RFTRAADAFA DFFVSWSRAT TPATVASFLV LSAPTVTVTV VAAGAGFAAL GWSEPVSVVA
FALLAPALCA PMNVIGSRVQ QIQTARAAAG RVRDLLATPP LPEGSGGGPR GARVVFDGVR
FAYPAQDGAE GSGAGEEVLR GVDLVLEPGT VTALVGPSGA GKTTLATLPA RFADVTAGAV
TVGGADVRDI PAEELYRTVG FVFQDVRLLR ETVADNIAMG RPGATREEVE EAARAARVAE
RIEALPRGYD SVVGEDADLS GGEAQRVSIA RALLADTPVL VLDEATAAVD PVSEAAIQDA
LGELARGRTV LVIAHRLSTV AGADLIAVMD EGSVVERGTH GELLARGGRY ADLWRAQHPE
AGDGDAHDGA ARDEPGEDQ