Gene Ndas_3231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3231 
Symbol 
ID9247088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3863018 
End bp3864808 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content76% 
IMG OID 
Productprotein of unknown function DUF1446 
Protein accessionYP_003681143 
Protein GI297562169 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.113119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGCTTGT TTTTCCGTGT GATCGGTGTC ACCCTCGTGT CCATGACCGC TGCTGCCACC 
CCGGCGCCCC CGCTGCTGGT CGCGAACGCC TCCGGTTTCT ACGGGGACCG CTTCGCCGCC
GTCCACGAGA TGCTCACCGA GGGGCGCGTG GACGTCCTCA CCGGCGACTA CCTGGCCGAG
CTCACCATGG CCATCCTCGG CCGCGACCAG CTCGGCGACC CCGGCCGCGG CTACGCCCGG
ACCTTCCTGA GCCAGATGCG CGAGACCCTG GCGCTGGTCA TGGAACGCGG CACCAGGGTG
GTCACCAACG CGGGCGGCCT CAACCCGCGC GGCCTCGCCG ACCGGCTCAC CGAACTCGCC
GACGGGCTCG GCCTCGAACC CCGCATCGCC TGCGTCACCG GCGACGACCT CATCGACCGC
GCCGAGGAAC TCGACCTGGG CACCCCCCTG ACCGCCAACG CCTACCTGGG CGCCTTCGGC
ATCGCCGCCT GCCTGGAGGC GGGCGCCGAC ATCGTCGTCA CCGGCCGCGT CACCGACGCC
TCCCTGGCCG TGGGCCCGGC CGCCGCCCAC TTCGGCTGGA CCCCCGGGGA CCTGGACGCC
CTGGCCGGGG CCACCGTCGC CGGGCACGTC ATCGAGTGCG GGACCCAGGC CACCGGCGGC
AACTACGCCC TGGCCGCCGA ACTCCTGCGC GAGGGCCGCG ACCTGGACCG GCCCGGCTTC
CCCCTCGCCG AGATCCACGC CGACGGCAGC GCGGTCATCA CCAAGCACCC CGGCACCGGC
GGCGCCGTCA CCACCGGGAC CGTCACCGCC CAGCTCGTCT ACGAGGTCGC CGGAGCCCGC
TACCCCGGCC CCGACGTCAC CGCCCGCCTG GACACCGTGC GCCTCACCAG GCAGGGGCCC
GACCGCGTCC TGCTCAGCGG AACCCGCGGC GAGGAACCTC CCCCCGACCT CAAGGTCGGA
CTGACCAGCC TCACGGGCTT TCGCAACGAG GTCGAGTTCC TGGTCACCGG CCTGGACGCC
GGGGCCAAGG CCGCACAGGC CGAACGCCAG ATGCGCGCAG CCCTCGCCGA CCGCGCGCCC
GACGACCTGC GCTTCACCCT CGTGCCAGCC CAGGACCCCC ACGGCGACAC CCAGGACGCG
GCCACCGCAC GCCTGCGGGT GGTCGCCCGC GACCACGACC CCGCGGTCGT GGGCCGCTCC
TTCGGGGCGG CCGCCGTGGA GCTGGCCCTG GGCAGCTACG CCGGATTCCA CCTCACCGCG
CCGCCGCGCG AGGCCCGACC CGACGGGGTC GCCGCCACGC ACGCGCTCGT GCCCGCCTCC
GAGGTCGCCC ACACCGCGAT CCTGCCCGAC GGCGCCAGGA TGCCCGTCGC GCCCGCGCCC
CGCACCCGGG CCCTGACCGG GGTCGCCGAG CCCCCGCTGC CCGAACCCCT GCCCCGAGGC
CCCGCGCGCC CGGTCCCGCT CGGCCTGGTC CTGGGCGCGC GCAGCGGCGA CAAGGGCGCC
GACGCCAACC TGGGCGTGTG GGTGCGCGGA GAGACGGCGT GGCGGTGGCT GGCCACCACC
CTGACCGCCG ACCTGCTGCG CGAACTCCTG CCCGAGACCG CCGGACTGCG CGTCACCCGG
CACCTGCTGC CCAACCTGCG GGCCGCCAAC TTCTGGATCG AGGGCCTGCT CGCCCCCGGC
ACCGCGCGCC GCGAGGGCGT GGACCCCCAG GCCAAGGGGC TGGGCGAGTG GCTGCGCGCC
CGCCGCGTCC CCGTCCCCGA GACCGTGTTG GCGGAGGTGG AGCAGCCGTG A
 
Protein sequence
MCLFFRVIGV TLVSMTAAAT PAPPLLVANA SGFYGDRFAA VHEMLTEGRV DVLTGDYLAE 
LTMAILGRDQ LGDPGRGYAR TFLSQMRETL ALVMERGTRV VTNAGGLNPR GLADRLTELA
DGLGLEPRIA CVTGDDLIDR AEELDLGTPL TANAYLGAFG IAACLEAGAD IVVTGRVTDA
SLAVGPAAAH FGWTPGDLDA LAGATVAGHV IECGTQATGG NYALAAELLR EGRDLDRPGF
PLAEIHADGS AVITKHPGTG GAVTTGTVTA QLVYEVAGAR YPGPDVTARL DTVRLTRQGP
DRVLLSGTRG EEPPPDLKVG LTSLTGFRNE VEFLVTGLDA GAKAAQAERQ MRAALADRAP
DDLRFTLVPA QDPHGDTQDA ATARLRVVAR DHDPAVVGRS FGAAAVELAL GSYAGFHLTA
PPREARPDGV AATHALVPAS EVAHTAILPD GARMPVAPAP RTRALTGVAE PPLPEPLPRG
PARPVPLGLV LGARSGDKGA DANLGVWVRG ETAWRWLATT LTADLLRELL PETAGLRVTR
HLLPNLRAAN FWIEGLLAPG TARREGVDPQ AKGLGEWLRA RRVPVPETVL AEVEQP