Gene Ndas_1575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1575 
Symbol 
ID9245425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1926748 
End bp1928253 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content79% 
IMG OID 
Productcytidyltransferase-related domain protein 
Protein accessionYP_003679510 
Protein GI297560536 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.528601 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0217414 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGGG GAACCGTGGT GGTGGTCGGC GACGCCCTGC TCGACGTGGA CCTGCGCGGC 
GTGTCCCGGC GCGACTGCCC CGACGTGCCC GCGCCCGTGC TGGAGGAGCC CGAGCCCTGG
TACCGGCCCG GAGGCGCGGC CCTGGCCGCC CGCCGCGCCC GGCAGGACGG CCGCGACGTG
GTCCTGGTGA CCGCCGTGGG CCGCGACGCG GCCGCCGACG AACTGGCCGC GCTCGTGGGG
GAGGGCGTGC GGCTGGTGGG GCTGCCCCTG ATGGAGCACA CGCCCACCAA GACCCGGGTC
CAGGCCAACG GCCGCACCGT GGCCCGTCTG GACCAGGGGT GCGAGGGCGT GGAACTCGAC
GCCCGCGCCG AGGACGTCGC CGAGGCCCTG GCCGGGGCGG CGGCCGTGCT CGTGGCCGAC
TACGGGCACG GGCTCACCCG CCAGCGCGCC GTACGCCGGG CCCTGGCGGC CTGCGCGCGG
CGCGGCGTCC CGCTCGTGTG GGACCCGCAC CCGCGCGGCG CCGACCCCGT GCCCGGAACC
CGCCTGGCCA CGCCCAACGC CGCCGAGGCC GGGGTCGCCG GGGGGACCGG CGAGCGGGCC
CTGCGCCGGG CCGGTGAGCT GGCCGGGGCG TGGGACGTGC ACTCGGTGGC CGTCACCCTC
GGCGCGCGCG GCGCCGCCTG GTCGGACGCC GGGGGCGGGT GCGCCCTCCT GCCGGGCACA
CCCGTCGACG CGCCGACCGA CACCTGCGGC GCCGGGGACG CCTTCGCCGC CGCCTGCGCC
ACGGCGCTCG CCGACGGCGA CGACGTACGC GACGCCGTCC GACGCGGGGT CGCCTCCGCC
TCGGCCTTCG TCGCCGGGGG CGGGGCCTCG GCCTACGCCG CCCCGAGCAC CGCCGCCGGG
CGGGCGCGCG CCGCCGCGGT GCCCGGCCCC CGCGCGGGCG AGGGCGGCCG GGCGGAGAGG
GTCGTGGCGA CCGGCGGCTG CTTCGACGTC CTGCACGCGG GCCACGTCGA CCTGCTGCGC
CGCGCCCGCG CCCTGGGCGA CCGCCTCGTG GTCCTGCTCA ACAGCGACGC CTCCGTGCGC
GCGCTCAAGG GCAGCGGCCG TCCGGTGGTC GCCGAACAGG ACCGCGCCCG CGTGCTGGGC
GCGCTCGACT GCGTGGACGA GGTGGTCGTC TTCGACGAGG ACACGCCCGT GCGCGCCCTG
GAGGAGCTGC GGCCCGACGT GTGGGTCAAG GGCGGCGACT ACGAGGTGGA GGACCTGCCC
GAGACGCCCG TCGTGCGGAG GGCCGGGGGG GGAGGTGGTC ACCGTGCCCC TCGTGCCCGG
CCACTCCACG ACCGGGCTGT TCACCCGCAT CCGCGGCCGG GGCCGCGACG GGGTACCCGC
CCGCTGAGGG CCCCGGGACC GGACGCCCCG GCCCCGGAGC CCGGGAAGTC GGAAACGAAC
GACGGACAGC GACGAACGAC GACGGAGAAC GAGGGAGACA GATGCGTCCA CTCGGAAACA
CGCTGA
 
Protein sequence
MSGGTVVVVG DALLDVDLRG VSRRDCPDVP APVLEEPEPW YRPGGAALAA RRARQDGRDV 
VLVTAVGRDA AADELAALVG EGVRLVGLPL MEHTPTKTRV QANGRTVARL DQGCEGVELD
ARAEDVAEAL AGAAAVLVAD YGHGLTRQRA VRRALAACAR RGVPLVWDPH PRGADPVPGT
RLATPNAAEA GVAGGTGERA LRRAGELAGA WDVHSVAVTL GARGAAWSDA GGGCALLPGT
PVDAPTDTCG AGDAFAAACA TALADGDDVR DAVRRGVASA SAFVAGGGAS AYAAPSTAAG
RARAAAVPGP RAGEGGRAER VVATGGCFDV LHAGHVDLLR RARALGDRLV VLLNSDASVR
ALKGSGRPVV AEQDRARVLG ALDCVDEVVV FDEDTPVRAL EELRPDVWVK GGDYEVEDLP
ETPVVRRAGG GGGHRAPRAR PLHDRAVHPH PRPGPRRGTR PLRAPGPDAP APEPGKSETN
DGQRRTTTEN EGDRCVHSET R