Gene Rcas_4084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4084 
Symbol 
ID5541595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5295697 
End bp5296692 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content53% 
IMG OID640896196 
Productnucleotidyl transferase 
Protein accessionYP_001434134 
Protein GI156744005 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1209] dTDP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.313929 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTCA TTATCCCTAC TGCCGGGCTT GGGACGCGCC TGCGCCCACA TACGTACAGT 
AAACCCAAGC CGCTGGTGTC GGTTGCCGGA AAGCCGGTCC TCGGTCATAT TCTCGATACG
CTGACCCGAT TTCCAATCGA CGAGATGATC TTCATCACCG GCTATCTGGG CAATCAAATT
GCCGATTATG TCACATCAAA TTATAAAATC CCGGCGCGCT TCATCGAACA AACGGAACTG
AAAGGTCAGG CGCATGCCGT CTATCTGGCG CGTGAAGTTG TCGATGGTCC CACGCTCATC
CTGTTCGTGG ATACGATTTT CGAGGCAGAC CTGAGTTGTC TGACCGAACA GGATATCGAT
GGCGCCATTT TCTGCAAGGA GGTGGACGAT CCGCGGCGGT TCGGCGTGGC GTTTACCAAA
GATGGATTCA TCACCCGGCT CGTGGAAAAA CCGGTGACCG ATGAGTCGAA ACTGGCGATG
ATCGGGCTGT ATTACATCCG CGACATTCAG TGGTTGATGC GCGCAATCGA AGTGCTCATG
CTGCGCAATA TTCAAACGAA AGGCGAGTAC TTCCTGACTG ATGCCCTGCA ATTGATGGTC
GAGAATGGCG CCCGTTTCAC TGCGCCGACG GTCGATGTCT GGGAAGACTG CGGGAAACCA
GAGACGGTAT TGCAAACGAA TCGGTATCTG CTCGATCACG GTCGGGATCA TGTCGATGCT
TCTCGATTGG ATGGTTCGAT CATTATTCCG CCTGTCTATA TCGACGATAC GGCGCGCGTG
ATCAACTCGA TCATCGGTCC GTATGTGTCG ATAGCGGCGG GGGCGATTGT CAAAGACTCG
ATCATTCGCG ATTCGATCAT CAATCGCGAT GCGCAGATCG TCTCCGCTAC ACTGCAATCG
AGTCTGATTG GTGATCACGC AGTGGTTCTG GGCGATTTTC GCGAGCTCAA TGTGGGAGAT
TCTTCCGAAA TCCGGTATGG ACGCCCTGCA CATTGA
 
Protein sequence
MKVIIPTAGL GTRLRPHTYS KPKPLVSVAG KPVLGHILDT LTRFPIDEMI FITGYLGNQI 
ADYVTSNYKI PARFIEQTEL KGQAHAVYLA REVVDGPTLI LFVDTIFEAD LSCLTEQDID
GAIFCKEVDD PRRFGVAFTK DGFITRLVEK PVTDESKLAM IGLYYIRDIQ WLMRAIEVLM
LRNIQTKGEY FLTDALQLMV ENGARFTAPT VDVWEDCGKP ETVLQTNRYL LDHGRDHVDA
SRLDGSIIIP PVYIDDTARV INSIIGPYVS IAAGAIVKDS IIRDSIINRD AQIVSATLQS
SLIGDHAVVL GDFRELNVGD SSEIRYGRPA H