Gene Rcas_4138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4138 
Symbol 
ID5541649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5356473 
End bp5358347 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content59% 
IMG OID640896249 
Productglycosyl transferase family protein 
Protein accessionYP_001434187 
Protein GI156744058 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0438] Glycosyltransferase
[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00102626 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTTCAAC CTGCAACCTG CAACCTGCAA CCTTCAACCT TCAGCGACCC GCAGCGCCCT 
CCGGTAACAA TTATCATCCT CACGTGGAAC GGGTTAGAGT ACACTCGTCG CTGTATCGAG
AGTATTCGCG CGCATACGAA GGGTATGGCG TATCACCTGT TGGTTGTGGA CAATGGGAGC
AGCGATGGGA CACTGGAGTG GCTGCGCGCG CAGGACGACA TCCGGGTAAT TGCGAATGAT
CGCAACCTGG GATTCACGCG CGGCAACAAT CAGGGCATGG CGGCGACTCC GCCGGATCAC
GATGTGCTAT TGCTCAACAA CGATACGCTG ATCATCCAGG ATTACTGGCT GGCGCACCTC
AGCAATGTGG CGCACAGTCA TCCAGAGTAT GGCATCGTCG GATGCACGCT GTTGCACGCC
AATGGACTGC TCCAGCATGC CGGAACGTAT ATGCCGGCAG ATAGTTTCTG GGGGTATCAG
ATCGGAGGCG GTGAGACGTA CATTGGGCAG TATCCGGGTG TGCGCGAAGT CGAAGGGATC
ACCGGCGCAT GCATGTACAT CCGACGCGAT GTGCGCGCGC GGATCGGCGG CTTCGACGAG
ACGTACACGT CGTACTTTGA GGATACCGAT TACTGTCTGC GAGCGCGCCA GGCGGGGTTC
AAAGTCGTTT GCACCGGCGG TACGCAGGTG ATCCACTACG AGAATACGAG CGCCAGGATC
AACAATGCGT CGTGGCAGGC GATGTGGGAC GAGGGACGCG AGATGTTTAC CCGCAAATGG
CGCACGTTTT ACAACCAGAA ATATCGTCGC GCTGTCGTCT GGCACTCGCT GGTGGCATCG
CCATCCGGGT ACGCCACCTC GTCGCGTGAA CTGGTGATCG AACTCGACCG CTGCGCTATC
GATGTGCGCC TGGCGTGCAT CTGGGGGAAT GATTTCACCG AGCCGCTGAC CGGCGATCCG
CGCATCGATC AGTTGCGCGC CCGTCTTAAG GACTCTCGTC TGCCCCAGGT GGTGTATCAT
CAGGGTGACT CTTTCATCAA GAATAGTGGA CGCTATCGCA TCGGCTATAC GATGCTGGAA
ACCGACCGGT TGCCGGATGA GTGGGTCTAC CAGGCGAACC AGATGGATGA AGTCTGGACG
CCAACGCACT GGGGGGCTGA GGTCTTTTGC GCCAGCGGCG TCCGGCGTCC GATCTCTGTC
GTTCCACTGG GGATCAACCC CGATTATTTT CACCCTGGCA TCACCGGACA TAAACCCGGC
AATCGCTTTG TTTTTCTCTC GATCTTCGAG TGGATCGAAC GCAAAGCGCC GGAACTGCTG
ATCCGCGCCT ATCAGCAAAC GTTTCGCCGC AGCGATGATG TGGTACTGCT GCTCAAAATC
TTCAACCACG ACCCCAGTCT TGATGTCGCC CGACGTATTG GCGACCTGAT CCGCAGCGAT
GGTCCGCCGA TTGTCGTTCT GCCGAATCAG CACGTTGCCG CCTATCAGGT TGGGTGTCTG
TACCGCAGCG CCGATTGTTT CGTGCTGCCG ACGCGCGGTG AGGGCTGGGG CATGCCTGCG
CTGGAGGCAA TGGCATGTGG TCTGCCGGTT ATTTCGACCG CTTGGGGCGG GCAGACGGAG
TTCCTCCATT CAGGTGTCGC CTATCCGCTT CGCATTCGTG GTCTTGTCCC GGCGGAAGCG
CGCGCGCCGT ACTACCGCGG GTTGCGCTGG GCTGACCCCG ATTTCGATCA TCTCTGTGCG
TTGATGCGCC ACGTGTATGA GCATCCCGAC GAAGCGCGCG CAGTCGGGAT GCGCGCTGCT
GCGGAAGCTG CCGCGCGCTG GACGTGGTCG CACGCCGCAG CGAAGATTAT CGAGCGCCTG
GAAGCGATTG AGTGA
 
Protein sequence
MVQPATCNLQ PSTFSDPQRP PVTIIILTWN GLEYTRRCIE SIRAHTKGMA YHLLVVDNGS 
SDGTLEWLRA QDDIRVIAND RNLGFTRGNN QGMAATPPDH DVLLLNNDTL IIQDYWLAHL
SNVAHSHPEY GIVGCTLLHA NGLLQHAGTY MPADSFWGYQ IGGGETYIGQ YPGVREVEGI
TGACMYIRRD VRARIGGFDE TYTSYFEDTD YCLRARQAGF KVVCTGGTQV IHYENTSARI
NNASWQAMWD EGREMFTRKW RTFYNQKYRR AVVWHSLVAS PSGYATSSRE LVIELDRCAI
DVRLACIWGN DFTEPLTGDP RIDQLRARLK DSRLPQVVYH QGDSFIKNSG RYRIGYTMLE
TDRLPDEWVY QANQMDEVWT PTHWGAEVFC ASGVRRPISV VPLGINPDYF HPGITGHKPG
NRFVFLSIFE WIERKAPELL IRAYQQTFRR SDDVVLLLKI FNHDPSLDVA RRIGDLIRSD
GPPIVVLPNQ HVAAYQVGCL YRSADCFVLP TRGEGWGMPA LEAMACGLPV ISTAWGGQTE
FLHSGVAYPL RIRGLVPAEA RAPYYRGLRW ADPDFDHLCA LMRHVYEHPD EARAVGMRAA
AEAAARWTWS HAAAKIIERL EAIE