Gene Ndas_4178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4178 
Symbol 
ID9248052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4988921 
End bp4990375 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content69% 
IMG OID 
ProductXanthine/uracil/vitamin C permease 
Protein accessionYP_003682079 
Protein GI297563105 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.214839 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAACTC CGAGTCCATC CCCCAGCGGT ACCGGCAAGA CGCCGTCGGG CCGCCTCGAC 
CGCTACTTCC GCGTCTCCGA GCGCGGATCC ACCTTCGGCA CCGAGGTGCG GGGCGGTCTG
GCGACCTTCT TCGCCATGGC CTACATCGTC GTCCTCAACC CGCTCATCAT CGGTACGGCC
GAGGACGTCA ACGGAGAGAC CCTGGGCATC CCCCAGGTGG CCGCCGTCAC CGCCCTGGTC
GCCGCGATCT CCTGCGTGCT CATGGGCGTG GTCAGCCGCT ACCCCTTCGC GATCGCGGCG
GGCATGGGAC TCAACGCCGT CGTCGCCTAC GGCATCGCGC CGGTCATGCC CTGGTCCGAC
GTCTTCCTGC TGATCATCAT CGAGGGCGTG CTGCTGCTGA TCCTGGTGCT CACCGGGTTC
CGCACCGCGG TCTTCGCCGC CATCCCGCCC GGCCTCAAGG TCGCCATCGC CGTCGGAGTG
GGCCTCTTCC TGGCCCTGGT CGGCCTGGTC AACGCCGGTT TCGTGCAGGC CGGTGAGGGC
ACCCCCGTCC AGCTCGGCAA CGGTGGCCTC CAGGGCTGGC CGATCCTGAT CTTCGTCATC
GGCCTGCTGA TCACCGTCGC GCTGTACGTG CGCAAGGTGC CGGGCTCGAT GCTCATCGGC
ATCATCGTCT CCACGGCCGT CGCCCTCCTC GTGGAGTCGC TGTTCGGCGG CGGCGACAAC
CGCCTCGGCT GGAGCCTGAC GGTCCCGACC CTGTCCACCG GCGGGGGAGT GGTGGCGGTC
CCCGACTTCT CGCTCGTCGG CATGTTCGCC GACGGCGGTC TCGACGTGTT CTCCCGCTGG
GCCGACATCG GCGTCGCCAC CGTGGTCATG CTGATCTTCA CGCTGCTGCT CGCCGACTTC
TTCGACACCA TGGGCACCAT GGTCGGCGTC GCCCACCAGG GCGACCTGGC CGACGAGGAC
GGCAACATCC GGGGCTCCCG CGAGGTGCTG GCCGCCGACT CCGTCGGCAC GATCCTGGGC
GGCCTGGGCT CGGCCTCGGT CGCCACCATC TACGCCGAGT CCGCCGCGGG CGTCGGCGAG
GGCGCCCGCA CCGGTATCGC GCCGATCGTC ACCGGCGTCC TGATGATGCT GGCCACCCTG
TTCACTCCGC TGGTGAACCT GGTGCCCTTC GAGGCCGCCA CGCCGGTGCT GGTGATCGTG
GGGTTCATGA TGATGGTCCA GGTCGTCAAC ATCGACTTCA GGGATCCGGC GCTGGGCCTG
GGGTCCTTCA TGGCGATCAT CATGATGCCC TTCACCTACT CGATCGCCAA CGGCATCGGG
TTCGGCCTGC TGACCTACGC CTTCGTCAGC CTGGTCACGG GCAAGGGACG CCAGGTCCAC
CCGCTGCTGT GGCTGATCTC GCTGGTCTTC CTGATCCACT TCGCCGAGGC GCCGATCAAC
GCCCTGATCG GCTGA
 
Protein sequence
MKTPSPSPSG TGKTPSGRLD RYFRVSERGS TFGTEVRGGL ATFFAMAYIV VLNPLIIGTA 
EDVNGETLGI PQVAAVTALV AAISCVLMGV VSRYPFAIAA GMGLNAVVAY GIAPVMPWSD
VFLLIIIEGV LLLILVLTGF RTAVFAAIPP GLKVAIAVGV GLFLALVGLV NAGFVQAGEG
TPVQLGNGGL QGWPILIFVI GLLITVALYV RKVPGSMLIG IIVSTAVALL VESLFGGGDN
RLGWSLTVPT LSTGGGVVAV PDFSLVGMFA DGGLDVFSRW ADIGVATVVM LIFTLLLADF
FDTMGTMVGV AHQGDLADED GNIRGSREVL AADSVGTILG GLGSASVATI YAESAAGVGE
GARTGIAPIV TGVLMMLATL FTPLVNLVPF EAATPVLVIV GFMMMVQVVN IDFRDPALGL
GSFMAIIMMP FTYSIANGIG FGLLTYAFVS LVTGKGRQVH PLLWLISLVF LIHFAEAPIN
ALIG