Gene Ndas_3430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3430 
Symbol 
ID9247297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4103155 
End bp4105050 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content74% 
IMG OID 
Productputative ABC transporter 
Protein accessionYP_003681341 
Protein GI297562367 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.790282 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.301569 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGGC CGTCGGCCGC ACCGGGGACC AACGGTCCCG GCGCTCCGGA CCGGTTCGGA 
ACCGGACCGG CCGCCACCTC CGAACAGCCC GCCCAGGGCT CCGCACCCGC GCGCGAGGCG
GACACCGAGG CCGTCGGCGC CCGACGGGGG CCCACCGTGC CGCTCGCCGC CGGACCGGCC
AAGCACCGCG CCCCCCAACC CGAAGAGAAC CACTTCGAAC AGGTCCTGGA AGCACTGCGC
AAGCACATCC GGGACCTGGA GTTCGCCGAG GGCCTGCCCA GTGACGAGGA GGGCCGCGCC
CTCCAGGCGG ACGTGCTCGC CCAGCTCTCC GACTACGTCC TGCCCCGGGT CCGCCGCCCC
GACATCCCGC TGCTGATCGC GGTCGCCGGA TCCACCGGAG CGGGCAAGTC CACCCTGGTC
AACAGCCTGG TCGGCGAACA GGTCACCACC ACGGGCGTGC GCCGCCCCAC CACCAACAGC
CCCGTCCTGG CCTGCAACCC CGCCGACGTC GACTGGTTCA GCGAGGCCTC CTTCATCCCC
TCCCTGCCGC GCGTGCGCCA GCAGGGCCTG GCCATGCCGG GCAAGGACGG CATGCTCGTG
CTCGCCGCCA CCGAGGCCAT GCCCCCGGGC GTGGCCCTGC TCGACACCCC CGACGTCGAC
TCCGCCGTCG CCGCGCACCA CGAGTTCGCG GCCAAGTTCC TCGACGCCGC CGACCTGTGG
GTGTTCGTCA CCACCAGCAG CCGCTACGCC GACGCGCGCG TGTGGGAGTT CCTCCAGGTC
GCCCGCGACC GCGACACCTC CCTGGCCGTC GTGCTCTCCC GGGTGCCGCG CAGGGGCAGG
CGACAGCTCC TGGACCACTT CGGCGCGATG CTGGAGGCCA ACGGCCTGGG CGCGGCCGCG
CGGTTCGCCA TCCCCGAGAC CGACCAGATC CGGGGCGAGC GCTTCACCTC CAACGTCGCC
GACCACATCC GCGACTACCT GGCGGAGGTC GCGGGCGAGG CCGAGCAGCG CGACCGCGTC
TCACGGCGCA CCTTCATCGG CGTCATCGAC AGCTTCCGGA CGCGGGTGCC CGAACTGGCC
CGCCAGGTGG AGGCGCAGAT CGAGACCGGC CGCACCCTCA CCGCCGCCGT CGACGACTCC
TACGCCGCCG CCCGCGACCG CGTCGACACC AGCCTGGCCG ACGGCTCCCT GCTGCGCGGA
TCGCTGCTGG CCCGCTGGCA GGAGGTGGCG GCCGGGGGAG AGCTCACCAG GAGCCTGCGC
CCGCGCGGCA GGCGCACGCG CCTGGGCGGC CGCGCCGAAC AGGCAGAGCG CGGCCAGCGG
GTCGGTGCCC TGGAGCGCGC CATCCGCGAC GGCCTGGAGG CCCTGGTGGT CTCCACCTGC
GAACGCGCCT CCGACGAGGT CGAGCGCAGG TGGCGCGAGG TCCCCGGCGG CGGCGACCTC
GCCTCCAAGG CCCTCAACCA ACCCGGCACC GGCCTGCTCT CACAGCAGAT CCGCCAGGAG
ATCACCGAGT GGCAGCGCGG CGTCGCCGAG ATGACCACCG CCAGCGGCGC CACCAAGCGC
TCCGTGGCGC GGTTCGTCAC CTTCGACCAC GACGTCGTCG CCCTGGTCCT GATCATCGAC
CTGCTCGGTT ACGAGCGGTC CCGCTCCGGA GGCGGCGCCC ACGCGGCCGA GGCCTCCGGC
CCTTCCCCGC AGCGCCTGCT CAAGGGGCTG TTCGGCGCCC AGTCACTGCG TAGCATTGGC
GCAGCGGCAC GGGACGACCT GCGCCAGCGC GTCCAGGGCC TGCTCGACCG CGAGCGCGTG
CCCTTCGACA GCGCCCTGGC GTCTGCCGCG ATCCCGTCCG AGGACACCGC CGTCCAGCTC
TACCAGGCCA CGTACAACCT TGAGATTGCA CGATGA
 
Protein sequence
MTRPSAAPGT NGPGAPDRFG TGPAATSEQP AQGSAPAREA DTEAVGARRG PTVPLAAGPA 
KHRAPQPEEN HFEQVLEALR KHIRDLEFAE GLPSDEEGRA LQADVLAQLS DYVLPRVRRP
DIPLLIAVAG STGAGKSTLV NSLVGEQVTT TGVRRPTTNS PVLACNPADV DWFSEASFIP
SLPRVRQQGL AMPGKDGMLV LAATEAMPPG VALLDTPDVD SAVAAHHEFA AKFLDAADLW
VFVTTSSRYA DARVWEFLQV ARDRDTSLAV VLSRVPRRGR RQLLDHFGAM LEANGLGAAA
RFAIPETDQI RGERFTSNVA DHIRDYLAEV AGEAEQRDRV SRRTFIGVID SFRTRVPELA
RQVEAQIETG RTLTAAVDDS YAAARDRVDT SLADGSLLRG SLLARWQEVA AGGELTRSLR
PRGRRTRLGG RAEQAERGQR VGALERAIRD GLEALVVSTC ERASDEVERR WREVPGGGDL
ASKALNQPGT GLLSQQIRQE ITEWQRGVAE MTTASGATKR SVARFVTFDH DVVALVLIID
LLGYERSRSG GGAHAAEASG PSPQRLLKGL FGAQSLRSIG AAARDDLRQR VQGLLDRERV
PFDSALASAA IPSEDTAVQL YQATYNLEIA R