Gene Ndas_5460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5460 
Symbol 
ID9249363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp647691 
End bp649730 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content70% 
IMG OID 
Productband 7 protein 
Protein accessionYP_003683345 
Protein GI297564372 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.306452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.831965 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGCTGA GTGCCACCGC GAACGGCCAG GAGGCCGCGG ACGCGGCCTT CCTGACCGTC 
GGCGTCGCCC TTGCCGTCGG CGTCGTCGTC CTCATCGCCC TGCTCCTGAT GGTCACCCGC
CTGTTCCGCA AGGTCGAGCA GGGCAAGGCG CTGATCATCT CCAAGGTCCG CGACGTCCAC
GTCACCTTCA CGGGCGCGAT CGTCCTGCCG GTCCTGCACA AGGCCGAGAT CATGGACATC
TCGCTCAAGA CGCTGACCGT GGACCGGCGC GGCCGTGAGG GCCTGACCTG CAAGGACAAC
ATCCGCGCGG ACATCAAGGT GTACTTCTAC GTCCGGGTCA ACGAGACCGC GGAGGACGTC
AAGAAGGTCG CCAAGTCCAT CGGCACCGAG CGCGCCAGCC ACCAGGACAC GCTCCAGGAG
CTGTTCAACT CCAAGTTCTC CGAGGGCCTC AAGACCGTCG GCAAGCAGTT CGACTTCGAG
GAGCTGTTCA CGCACCGCGA GCAGTTCCGT CAGGCGATCA TCTCCCTGAT CGGCACCGAC
CTCAACGGCT ACAGCCTGGA GGACGTGGCC ATCGACGAGC TGGAGCAGAC CCCGCTGCAC
CAGCACGACG CCAACAACAT CCTCGACGCG CAGGGCATCT CCAAGATCAC CGAGCGCACC
GCGATCGAGC ACAAGCGCAC CAACGAGTTC GAGAACGACC GCCGCAAGGA ACTCGACCGG
CAGAACACCG AGACCGCCGA GACCCTCGCC GAGCTGGAGA AGCGCCGCGA GGAGGCCGCC
GCCAAGGCCA AGCGCGAGAT CGAGATCATC CGCGCCCGCG AGGAGGCCGA GACCGCCCGC
GTGCAGGCCG AGGAGCGCCT CAAGGCCGAG ACGGCCAACA TCCGCACCTC CGAGGCCCTG
GGCGTGCAGC ACCAGAACCA GCAGCGCGAG ATCGCGGTGG CCGAGAAGAA CCGCGAGCGC
GTCATCGCCA TCGAGAGCGA GCGCATCGAG AAGGACCGCC TCCTGGAGGT CATCGGGCGC
GAGCGCGAGA CCGAGCTGAG CCGCATCGCC AAGGACAAGG AGGTCGAGGC CGAGAAGCGC
GAGGTCGCCG ACGTCGTCCG CGAGCGGATC GCCGTGGACC GCACCGTGGC CGAGCAGGAG
GAGGCCATCA AGCGCCTGCG CGCGGTCGAG GAGGCCGAGC GCACCCGCCA GGCCGTCATC
ATCCAGGCCG AGGCCGAGGC CCAGGAGAAC CTGGTCAAGG ACATCAAGGC CGCCGAGGCC
GCGGAGGCCG CCGCCAAGCA CCGGGCCGCC GAGGAGCTCA CCCTGGCCGA GGCCCGCCAG
CAGGCCGCCG AACTGGACAC CCGCGCCAAG ATCCGCCTCG CCGAGGGGAT CCAGGCCGAG
GCCGCCGCCA CCGGCCTGGC CGAGGTGCAG GTCCGCCAGC AGGACGCCGA GGCCATCGAG
AAGGTCGGCC GCGCCGAGGC CGCCGTGGCC AGCGAGAAGG CCCGCGTCGA GGCCGAGGCC
GTGGAGCAGA AGCTGCGCGC CGAGGCCGCC GGTCTCACCG ACAAGGCCGA GGCCATGGCC
GCCCTGGACC AGGTCAGCCG CGAGCACGAG GAGTACCGCC TGCGCCTGGA GGCCGAGAAG
GAGATCCGCC TGGCCGGGAT CAACGTGCAG CGCGAGGTCG CCGAGGCCCA GGCCACCGTG
CTGGCCACCG GCCTGGAGAA CGCCGACATC AACATCGTCG GCGGCGACGG CGCCTTCTTC
GACCGCATGG TGGGCTCCAT CGGCCTGGGC AAGGCCGTGG ACGGCTTCGT CGGCAACTCC
AAGACCGTGC AGGCGCTGGG CGGCAACTGG CTCAACGGCG AGGGCGACTT CGCCGCGGAC
ATGCGCAGGG TGATGGAGTC GGTGTCCACC GAGGACGTCA AGAACCTCAC CGTCTCCGCG
CTGCTGCTCA AGCTGATCGG CGCGGGCGGC CCGCAGGCCG ACAAACTCGA CGGCCTGCTC
GGCACCGCCC GCTCGCTGGG CGTGGACCAG CTGCCCGCCT CCGCCCTCGC CAAGCAGTAG
 
Protein sequence
MLLSATANGQ EAADAAFLTV GVALAVGVVV LIALLLMVTR LFRKVEQGKA LIISKVRDVH 
VTFTGAIVLP VLHKAEIMDI SLKTLTVDRR GREGLTCKDN IRADIKVYFY VRVNETAEDV
KKVAKSIGTE RASHQDTLQE LFNSKFSEGL KTVGKQFDFE ELFTHREQFR QAIISLIGTD
LNGYSLEDVA IDELEQTPLH QHDANNILDA QGISKITERT AIEHKRTNEF ENDRRKELDR
QNTETAETLA ELEKRREEAA AKAKREIEII RAREEAETAR VQAEERLKAE TANIRTSEAL
GVQHQNQQRE IAVAEKNRER VIAIESERIE KDRLLEVIGR ERETELSRIA KDKEVEAEKR
EVADVVRERI AVDRTVAEQE EAIKRLRAVE EAERTRQAVI IQAEAEAQEN LVKDIKAAEA
AEAAAKHRAA EELTLAEARQ QAAELDTRAK IRLAEGIQAE AAATGLAEVQ VRQQDAEAIE
KVGRAEAAVA SEKARVEAEA VEQKLRAEAA GLTDKAEAMA ALDQVSREHE EYRLRLEAEK
EIRLAGINVQ REVAEAQATV LATGLENADI NIVGGDGAFF DRMVGSIGLG KAVDGFVGNS
KTVQALGGNW LNGEGDFAAD MRRVMESVST EDVKNLTVSA LLLKLIGAGG PQADKLDGLL
GTARSLGVDQ LPASALAKQ