Gene Rcas_3640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3640 
Symbol 
ID5541142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4760153 
End bp4762249 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content51% 
IMG OID640895760 
Productglycosyl transferase group 1 
Protein accessionYP_001433707 
Protein GI156743578 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000440881 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACGCA TTTGCTTTGT CACGCACGAG ATTGCTCCAA CTACTTGGGG TGGTTGCGGG 
GTATTGTTAT ACAACGCGGC CTGTATGCTC TTGAGAAAAG GTCACGAAGT TATTTTCGTG
TTGGACTTGC CGCATGATTA CTTTGATAGA TTTCAAGGGG TTGACCGTTT GCGCTTTCCC
AACCCGGAGA AATGTCGCGC GTATCACGCT GAGAGTATTG TTGCCCGGAG TGATCTTAAT
GAAAACTCTT TTCCTTTTCC ATCTCTCTGG AAAGCGTATC GAATTCATCT GGCGTGCAAA
TATGTGGCTG AAAGGGAAAA ACCCGATCTC ATCGAGTTTC ATGATTTTCT AGGGGTAGGG
CACTATGCTC TGACGGCAAA AGCGGCGGGT CTGTGTTATC AGGCGACACA CCTGGCGGTT
CGTTTGCACA ACTCCATCGA GGTTATGGAT ATTCACTCTT CCTCTGCGCA CCTACATGCT
CATAACCACT TTGTGCATGA TCTTGAGCGT AGCGCTCTGC AGCTTGCCGA AACGATCCTC
TATCCGTCAC CATCATACTT GCGCGAGGCA TATCAACCGT TCTACCCTCT CTGGTTTGGG
AGGGTTGTGG AGTCTCAATC GCCGCTTGTT GATGTTCCGT CGAAAAACAA TTACAGCGAT
GAGGATCATG TTATTCTGTT CTATGGACGG ATTTTTTCGA TGAAGGGTGT TGATGTATTT
GTTGATGCTG CTGTTGAAAT GTTGCGCAGA TACAGCGAAG TGCGTTTTGT GCTTGCAGGG
TATGACTCTC GGGAGGCGCC CGATGGAAGT CTTACCTATG AGCAATTCTT AAGGCGCAAA
ATTCCCTCGC GCTTTCAGTC GGCCTTCGAA TTCGTCGGAC AGCTTGACCG TTTACAAGTA
GAATCTTTGC TCCCGCGTGT TCGATTTGCA GTGTTTCCAA ACCACTACGA GTCGTTCTGT
TATGCTGCCC ACGAACTTTA TGCGGCAGGC ATTCCGGTCA TTGTTTCGAA CATCCCTGGA
TTCAGGGATG TTTTCAGGCA TGAGGAGAAT GCCCTGGTTT TTGATGGTAC CGTAGAAGAT
CTGGCAAGGC AGATGATGCG TCTTTGGAAT GACCATGCAT TGCGCCAGCG TCTCATATTT
CCCTATCCTG TCGCAACTCG ACCTCTTGGT GATATCTATG ACAATCCCCC CCGCGACAGT
TGGATTGTGA GAGGTGATCA TGCCTCATGT TCTTTGCTCG TGTGTGTGAT CGGTGAAGAA
GGCGCCCTGT TTCAAGAGAC CATTGGGAGT CTAGAACGAG TTCATAAAAG TGATATGCGA
ATTGTTCACT TGCGACCGGC AGGCTCGACA AAAGATGAAG CCTGTGGGTG GGTTCTTGGT
CAATTGTTTC AGTTCCTTTC TCTGGAAGGT GATGCTCTTC TTCCGACCGA GGTTCGTACC
GGACAGGCGC TGTTACTGTT GCGTGCCGGG GATCGTGTCG CGCCAGATTT TATTCGTGTC
GCCTGTAATA CGCTTGCTCG TCAGCCTCAG ATCGGTTTTG TCGGCGCCTG GCATCGTGTT
CGCGCGGCTG AAAAGGAATG TATCGAGAAC TTTCCATTTG ACGCATCCCC GGAGTTGCTG
CCTTTTCTTA GCGGTAAGAT GCTGCATCGT TTTGTTGTGC GCACCTTGCC TGATCGCATG
CTGATTGACG TGTTTGACTC AAAAGCGGGC GCCTGCGGTG AGATTGCGTA TGTCTGGAGT
CTTGATGCGC AGGGTCAGCG AGGGCTAATA ATTCCAGAAC CGCTGGTCGA ACGCGGTGAA
GAACCGCATG CTGTTCCGCG CCTGAATGAA CTCAGCGCAC TGATATTACG CGATACATCG
TACTGGCATC AGTCGCGTCT GGCGCGGCTT CTTGTCTGGT ATGTCGGTTA TGCCAGGGCG
CATCGTTTTC CACATCCGAG TCTGCACGAG GCGCCTGTGG TTGAGCCTCG CCTGTGGCGC
CTGGTTCGCC GGGTGTATAC AATGCCGGGC TATGCCGTAG CGCGCAATGC TGCGTTTGCC
TTCTACCGCC TCTGGAGGAG TCTGGATCGA GGTTATGGAA AGACGAACGC CCTATAA
 
Protein sequence
MKRICFVTHE IAPTTWGGCG VLLYNAACML LRKGHEVIFV LDLPHDYFDR FQGVDRLRFP 
NPEKCRAYHA ESIVARSDLN ENSFPFPSLW KAYRIHLACK YVAEREKPDL IEFHDFLGVG
HYALTAKAAG LCYQATHLAV RLHNSIEVMD IHSSSAHLHA HNHFVHDLER SALQLAETIL
YPSPSYLREA YQPFYPLWFG RVVESQSPLV DVPSKNNYSD EDHVILFYGR IFSMKGVDVF
VDAAVEMLRR YSEVRFVLAG YDSREAPDGS LTYEQFLRRK IPSRFQSAFE FVGQLDRLQV
ESLLPRVRFA VFPNHYESFC YAAHELYAAG IPVIVSNIPG FRDVFRHEEN ALVFDGTVED
LARQMMRLWN DHALRQRLIF PYPVATRPLG DIYDNPPRDS WIVRGDHASC SLLVCVIGEE
GALFQETIGS LERVHKSDMR IVHLRPAGST KDEACGWVLG QLFQFLSLEG DALLPTEVRT
GQALLLLRAG DRVAPDFIRV ACNTLARQPQ IGFVGAWHRV RAAEKECIEN FPFDASPELL
PFLSGKMLHR FVVRTLPDRM LIDVFDSKAG ACGEIAYVWS LDAQGQRGLI IPEPLVERGE
EPHAVPRLNE LSALILRDTS YWHQSRLARL LVWYVGYARA HRFPHPSLHE APVVEPRLWR
LVRRVYTMPG YAVARNAAFA FYRLWRSLDR GYGKTNAL