Gene Rcas_4137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4137 
Symbol 
ID5541648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5355178 
End bp5356473 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content62% 
IMG OID640896248 
Productglycosyl transferase group 1 
Protein accessionYP_001434186 
Protein GI156744057 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000953773 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCACATCG GCGTTGACAT TTCGCTCCTG CGAATTGCTC AGGCAGGCGT GCTGACCTAT 
CATCGGTCGC TGCTCGACCA TCTGGTGCGA GCCGGGCGTG ATTGCCATTT TACGCTGATC
GATGTGCTGC CGCTCAACCC TGGCCGTTCG ATGCTGTGGC TGGCAGCCCT CGACTCGCCG
AATGTGCGGG TCGTGCGTTG CCCCGGCGTT CGGCGTGGCT ACCTGAGCGC GCTTCCTGCG
TTTCGTGATG GAGTGGCGCA TCCGATTGCG GCGCGGATCG ACCGCATCCT CGATCCAATC
TGGTCGCAAC TGGCGGTCGC CGAGATGGGG CTGGAACTCA GGGGCGCCAC GAGGTTCGTT
GAGGTCTTTC ATGCTTCCGA TCAATTGCCT TATGCACCGC CTGGCGCTGC AACCGTGCTG
ACCATTCACG ATCTGACCAC CCGGCGCTTC CCCGATATGC ATGTGGCGGA GAACGTTGCG
TTGCATGCAG CCAAGGAGCG ATTTGCCCGC GACCGCGCCG ACCGGATTAT TGCCGTGTCG
GAAGCGACCC GCCGTGATAT TGTGTGCGAA CTGGGCATTC CGCCTGAGCG GATCAGTGTG
GTGTATGAAG CCGCAGATGT GCGCTTTCGT CCACGCGCGC CGGATGAAAC GTGTTCGGTT
CTGGCGCGAT ACGATATAGC GCATGGCGCG TATGTGCTGA GTGTTGGCAC GCTCGAGCCG
CGCAAGAACT ACATCCGTTT GATCGAAGCG TATGCCGTGT TGTGTGCGCG GTATGCCGCA
GATGCGCGTC GTTTGCCGCC GTTGATTATT GCCGGCGGCT ACGGCTGGAA GCACGACGCG
ATCCTCGCCG CGCCGGAACG CGCCGGAGTT GCCGGACAGG TCCGATTTAT CGGGCGCATT
CCCGATGACG ACCTGCCTGC ACTGGTTGCC GGGGCGCGCC TGTTTGTGTA TCCTTCACTG
TACGAAGGGT TTGGTCTTCC GCCGCTGGAG GCGCTTGCTT CAGGAACGCC GGTGGTGGTG
GCAAACACCT CATCGCTGCC GGAGGTGGTT GGCGATGCCG GATTGTACTG CGATCCGTAT
CAGGTGAGCG ATATTGCGCG CCAGATCGCT GCACTCCTGG ACAGCGAGGA CCTGGCGTTG
CGGTTACGGC ACGCTGGAAT TGAACGCGCA AAGCAGTTCT CGTGGGAGCG CGCTGCCCGT
GAAACGCTTG CAGTCTATGC GCAGGCGCGC GCTGAACGCT GTGCGAGGCG ACGATGGCAA
TTTACTGGGT TCATTCCGAC AGCGATGACG CGCTGA
 
Protein sequence
MHIGVDISLL RIAQAGVLTY HRSLLDHLVR AGRDCHFTLI DVLPLNPGRS MLWLAALDSP 
NVRVVRCPGV RRGYLSALPA FRDGVAHPIA ARIDRILDPI WSQLAVAEMG LELRGATRFV
EVFHASDQLP YAPPGAATVL TIHDLTTRRF PDMHVAENVA LHAAKERFAR DRADRIIAVS
EATRRDIVCE LGIPPERISV VYEAADVRFR PRAPDETCSV LARYDIAHGA YVLSVGTLEP
RKNYIRLIEA YAVLCARYAA DARRLPPLII AGGYGWKHDA ILAAPERAGV AGQVRFIGRI
PDDDLPALVA GARLFVYPSL YEGFGLPPLE ALASGTPVVV ANTSSLPEVV GDAGLYCDPY
QVSDIARQIA ALLDSEDLAL RLRHAGIERA KQFSWERAAR ETLAVYAQAR AERCARRRWQ
FTGFIPTAMT R