Gene Ndas_4076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4076 
Symbol 
ID9247948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4874248 
End bp4875882 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content70% 
IMG OID 
Productglycosyl transferase family 39 
Protein accessionYP_003681978 
Protein GI297563004 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.639567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.932979 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCA CCGCACCTTT CTCCGAGGCA GAGCCCTCGC CCCCGCCGCG GTCGGCGCGG 
ATACGCGCCC GCTTCCTGCC CGTCAACCCC GCGCCCCGCT GGCTGGGCTG GCTGGGCGCC
TTCGCCGTCG CGCTGTTCGC CGGGGTGCTG AGGTTCTTCA ACCTCGGCCA GCCCGACCGG
ATCTACTTCG ACGAGACCTA CTACGCCAAG GACGCCTACG GGCTCTGGAA CTTCGGCTAC
GAGCACGAGA CCGTCGAGGA GCCCGTCGAG GCCGACGACC TGCTCGCCCA GGGGTACCAG
GACATCTTCA CCGGCACGGG CGACTTCATC GTGCACCCGC CGGTGGGCAA GTGGATGATC
GCCCTGGGCG ACTGGCTGTG GTCGCTCCTG CCGTTCGGCA CGACCATGAC CCCCGAGGCC
TGGCGGTTCG CCTCCGCCGT GGCCGGGGTG CTCTCGGTCC TCATCCTGGT GCGGCTGGCC
ACGCGGATGA CCCGCTCGGT GCTGCTGGGC TGCACGGCGG GGCTGATCAT GGCGCTGGAC
GGCCTGCACT TCACGCTGAG CCGCATCGCC ATGGTGGACA TCTTCCTGAC CCTGTGGATC
CTGGCCGGAT TCGCGTGCCT GGTGATCGAC CGGGACAGCA CCCGGGAGCG GATGGCCCGC
CTGGCGGAGG CGGGAGGGGA CCTGGCGTCG GTGGGCTGGC TGGGGATGCG CTGGTGGCGG
CTGGCGGCCG GTCTGTGCTT CGGCCTGGCG GTGGGCACCA AGTGGTCGGC CCTGTTCTTC
GTCGCGGCGT TCGGCCTGCT CACGGTGGCG TGGGACTACG GGGCCCGCAG CAGCGTCGGC
CAGCGCGGCT TCTTCTGGCG GTGGCTGGGC GTCGACGCCG TCCCGGCGTT CGTGCAGACC
GTGGTGGTCG CGGGGGTCGT CTACCTGGTC TCGTGGTCGG GGTGGCTGTT CACCCGCGGC
GGCTACAACC GCGACTTCGC GGACGGCATG GCGCCCGAGT GGGTGCCCGG GTTCCTGCGG
GCGCCGGTGG AGGCGCTGTG GAGCCTGGTG GACTACCACC AGCGGATGAT GACCTTCCAC
ACCGACCTGA CCAGCGACCA CGCCTACATC TCCGCGCCGT GGGAGTGGCT GGTCATGCGC
ACCCCGGTGA TGTTCCACTA CAACGGCGAG GTCGCCTCGT GCGACACGGG CGACTGCGTC
ACCTCCGTGG TCTCCATCGG CACGCCGGTC ATCTGGTGGT CCAGCCTGCT CGCGCTGGCG
GTGATGCTCG GCTGGTGGGT GACCTTCCGC GACTGGCGGG CCGGGGCGGT GCTGCTGGCC
GTGGCCGCGG GGTGGCTGCC GTGGTTCGCC TACCCGGACC GGCCCATGTT CCTGTTCTAC
GCCGTCCCGC TGCTGCCCTT CCTGGTCCTG GCGATCGTGC TGGCGCTGGG CCTGGCGATG
GGGGCCGGGG AGGACAGTCC GCGCTTCGCC CCCTACACGC GTGCGGTGGG CGGCATCGTC
TACGGCGTGG TCCTGCTGTT GATCGTGGCC AATTTCGCCT ACTTCTACCC GGTGTTGTCG
GCGTATCCGA TCGACGAGGG TATGTGGCGT GAACGCATGT GGTTCGACGT GTGGATCTAC
GGCAGCGGCG GTTAG
 
Protein sequence
MTTTAPFSEA EPSPPPRSAR IRARFLPVNP APRWLGWLGA FAVALFAGVL RFFNLGQPDR 
IYFDETYYAK DAYGLWNFGY EHETVEEPVE ADDLLAQGYQ DIFTGTGDFI VHPPVGKWMI
ALGDWLWSLL PFGTTMTPEA WRFASAVAGV LSVLILVRLA TRMTRSVLLG CTAGLIMALD
GLHFTLSRIA MVDIFLTLWI LAGFACLVID RDSTRERMAR LAEAGGDLAS VGWLGMRWWR
LAAGLCFGLA VGTKWSALFF VAAFGLLTVA WDYGARSSVG QRGFFWRWLG VDAVPAFVQT
VVVAGVVYLV SWSGWLFTRG GYNRDFADGM APEWVPGFLR APVEALWSLV DYHQRMMTFH
TDLTSDHAYI SAPWEWLVMR TPVMFHYNGE VASCDTGDCV TSVVSIGTPV IWWSSLLALA
VMLGWWVTFR DWRAGAVLLA VAAGWLPWFA YPDRPMFLFY AVPLLPFLVL AIVLALGLAM
GAGEDSPRFA PYTRAVGGIV YGVVLLLIVA NFAYFYPVLS AYPIDEGMWR ERMWFDVWIY
GSGG