Gene Ndas_5398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5398 
Symbol 
ID9249301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp576701 
End bp578224 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content70% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionYP_003683283 
Protein GI297564310 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.298595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTACCG CACGTTTCAT AGGCGGCCTG CTGATCGGGC TCATCGCCCT CGCCCTGGCC 
GCGGCACTCT TTGTCTACTG GTTCGGCTAC GCGACGGACG TGGCCGAGGC CTCCGGCCTC
GGCGCCCTGG GTTACGTGTT CCTGTGGTTG GCCTTCGGCG CCAACCTGCT GCTGTGGACC
ACCGTCGGAC TCGTGCGCCT GGGCGAGGAC TCGGTGCGCG CCGTGCTCCG GTCACCGCGC
GCCGGACACC GCGGCGCCGT CGGGGGCGGG GCCCGCGAAC GCGTCCTGGT GGGCGCGGGC
GGTTCCGGCG GCGCCGCACT CGCCGAGAGG GGCGGCGACG CCTCCGAGGG GGGCGCCTCG
GCGGCCGTGG CGGCCCGGGC GGAGGGTTCC GGGCGGGAGG TGTCCCTGGC CGTCATCATC
CCGGCGCACA ACGAGGAACC CGTCATCGGC GGCGCCATCG CCTCCGCCAT GGGGCTGTTC
GAACGCTGGG ACATCTACGT GGTCTCGGAC TCGTCGAGGG ACTCCACCGC CCAGATCGCG
GCCAAGACCG GCGTGAACGT CCTCGAACTC CTCGCCAACC GCGGCAAGGC CGGGGCCATC
GAGGCGGTCA TCGAGGAGTT CTCGCTGACC GACAACTACG ACGGCGTCCT CATCCTGGAC
GCCGACACCG AGCTCGACCC CGGGTACGTG GAGGGCGCCC GGAGGCAGCT GGCCGACCCG
TCGGTGGCGG CGGTCGCGGG CTTCGTCGTC TCGGAGTGGA AGCCCGGGGA GCGCGGTTTC
GTCGGCCGGA TGATCTCGGC CTACCGGGAC CGCCTGTACT TCATGCTCCA GTACCTCATG
CGCTTCGGGC AGACCTGGCG GCACGCCAAC ACGGCCTTCA TCGTGCCCGG CTTCGCCAGC
GTCTACCGCA GCGAGGCGCT CAGGGAGATC GACGTCAACC CCAAGGGGCT GGTGATCGAG
GACTTCAACA TGACCTTCGA GGTGCACCAC AAGCGCCTCG GCAGGGTCTC GATGAACCCC
GACACCAAGG CCTACAGCCA GGACCCGTTC ACCTTCCGGG ACTACTACAA GCAGGTCACC
AGGTGGACGC TGGGCTTCTG GCAGACCATC CGGCGCCACC GGGTGTGGCC GAGCCTGTTC
TGGGCCTGCC TGGCGCTGTA CATCCTGGAG GTGGTCCTGG TCTCCGCCGT GCTGCTGGTC
ACCACCGTGG TCGGCCTGTT CGTCCTGGCG GGGACGCTGG GCGGCGAGTT CTTCCTGAGC
CTGCCGTTCG TCGGGGAGGC CTTCACCGCG GTGACGGCCT TCCTGCCGCT GCTGGCGATC
GCCATCGGCC TGTTCATCCC GGACTACATG CTCACCTGCC TGATGGCGGC GATCCGGCGG
CGGCCGTCCT ACCTGGTCTA CGGCCTGCTC TTCTTCCCGA TCCGGCTCGT GGACGCCTAC
CTCGCGCTGC GGATGATCCC CAAGGCGTGG ACCACCGAGT CCGACGGCCG GTGGAGCAGC
CCGGACCGCG TCTCGGGCAG GTGA
 
Protein sequence
MRTARFIGGL LIGLIALALA AALFVYWFGY ATDVAEASGL GALGYVFLWL AFGANLLLWT 
TVGLVRLGED SVRAVLRSPR AGHRGAVGGG ARERVLVGAG GSGGAALAER GGDASEGGAS
AAVAARAEGS GREVSLAVII PAHNEEPVIG GAIASAMGLF ERWDIYVVSD SSRDSTAQIA
AKTGVNVLEL LANRGKAGAI EAVIEEFSLT DNYDGVLILD ADTELDPGYV EGARRQLADP
SVAAVAGFVV SEWKPGERGF VGRMISAYRD RLYFMLQYLM RFGQTWRHAN TAFIVPGFAS
VYRSEALREI DVNPKGLVIE DFNMTFEVHH KRLGRVSMNP DTKAYSQDPF TFRDYYKQVT
RWTLGFWQTI RRHRVWPSLF WACLALYILE VVLVSAVLLV TTVVGLFVLA GTLGGEFFLS
LPFVGEAFTA VTAFLPLLAI AIGLFIPDYM LTCLMAAIRR RPSYLVYGLL FFPIRLVDAY
LALRMIPKAW TTESDGRWSS PDRVSGR