Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4138 |
Symbol | |
ID | 5541649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5356473 |
End bp | 5358347 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640896249 |
Product | glycosyl transferase family protein |
Protein accession | YP_001434187 |
Protein GI | 156744058 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG0438] Glycosyltransferase [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00102626 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTTCAAC CTGCAACCTG CAACCTGCAA CCTTCAACCT TCAGCGACCC GCAGCGCCCT CCGGTAACAA TTATCATCCT CACGTGGAAC GGGTTAGAGT ACACTCGTCG CTGTATCGAG AGTATTCGCG CGCATACGAA GGGTATGGCG TATCACCTGT TGGTTGTGGA CAATGGGAGC AGCGATGGGA CACTGGAGTG GCTGCGCGCG CAGGACGACA TCCGGGTAAT TGCGAATGAT CGCAACCTGG GATTCACGCG CGGCAACAAT CAGGGCATGG CGGCGACTCC GCCGGATCAC GATGTGCTAT TGCTCAACAA CGATACGCTG ATCATCCAGG ATTACTGGCT GGCGCACCTC AGCAATGTGG CGCACAGTCA TCCAGAGTAT GGCATCGTCG GATGCACGCT GTTGCACGCC AATGGACTGC TCCAGCATGC CGGAACGTAT ATGCCGGCAG ATAGTTTCTG GGGGTATCAG ATCGGAGGCG GTGAGACGTA CATTGGGCAG TATCCGGGTG TGCGCGAAGT CGAAGGGATC ACCGGCGCAT GCATGTACAT CCGACGCGAT GTGCGCGCGC GGATCGGCGG CTTCGACGAG ACGTACACGT CGTACTTTGA GGATACCGAT TACTGTCTGC GAGCGCGCCA GGCGGGGTTC AAAGTCGTTT GCACCGGCGG TACGCAGGTG ATCCACTACG AGAATACGAG CGCCAGGATC AACAATGCGT CGTGGCAGGC GATGTGGGAC GAGGGACGCG AGATGTTTAC CCGCAAATGG CGCACGTTTT ACAACCAGAA ATATCGTCGC GCTGTCGTCT GGCACTCGCT GGTGGCATCG CCATCCGGGT ACGCCACCTC GTCGCGTGAA CTGGTGATCG AACTCGACCG CTGCGCTATC GATGTGCGCC TGGCGTGCAT CTGGGGGAAT GATTTCACCG AGCCGCTGAC CGGCGATCCG CGCATCGATC AGTTGCGCGC CCGTCTTAAG GACTCTCGTC TGCCCCAGGT GGTGTATCAT CAGGGTGACT CTTTCATCAA GAATAGTGGA CGCTATCGCA TCGGCTATAC GATGCTGGAA ACCGACCGGT TGCCGGATGA GTGGGTCTAC CAGGCGAACC AGATGGATGA AGTCTGGACG CCAACGCACT GGGGGGCTGA GGTCTTTTGC GCCAGCGGCG TCCGGCGTCC GATCTCTGTC GTTCCACTGG GGATCAACCC CGATTATTTT CACCCTGGCA TCACCGGACA TAAACCCGGC AATCGCTTTG TTTTTCTCTC GATCTTCGAG TGGATCGAAC GCAAAGCGCC GGAACTGCTG ATCCGCGCCT ATCAGCAAAC GTTTCGCCGC AGCGATGATG TGGTACTGCT GCTCAAAATC TTCAACCACG ACCCCAGTCT TGATGTCGCC CGACGTATTG GCGACCTGAT CCGCAGCGAT GGTCCGCCGA TTGTCGTTCT GCCGAATCAG CACGTTGCCG CCTATCAGGT TGGGTGTCTG TACCGCAGCG CCGATTGTTT CGTGCTGCCG ACGCGCGGTG AGGGCTGGGG CATGCCTGCG CTGGAGGCAA TGGCATGTGG TCTGCCGGTT ATTTCGACCG CTTGGGGCGG GCAGACGGAG TTCCTCCATT CAGGTGTCGC CTATCCGCTT CGCATTCGTG GTCTTGTCCC GGCGGAAGCG CGCGCGCCGT ACTACCGCGG GTTGCGCTGG GCTGACCCCG ATTTCGATCA TCTCTGTGCG TTGATGCGCC ACGTGTATGA GCATCCCGAC GAAGCGCGCG CAGTCGGGAT GCGCGCTGCT GCGGAAGCTG CCGCGCGCTG GACGTGGTCG CACGCCGCAG CGAAGATTAT CGAGCGCCTG GAAGCGATTG AGTGA
|
Protein sequence | MVQPATCNLQ PSTFSDPQRP PVTIIILTWN GLEYTRRCIE SIRAHTKGMA YHLLVVDNGS SDGTLEWLRA QDDIRVIAND RNLGFTRGNN QGMAATPPDH DVLLLNNDTL IIQDYWLAHL SNVAHSHPEY GIVGCTLLHA NGLLQHAGTY MPADSFWGYQ IGGGETYIGQ YPGVREVEGI TGACMYIRRD VRARIGGFDE TYTSYFEDTD YCLRARQAGF KVVCTGGTQV IHYENTSARI NNASWQAMWD EGREMFTRKW RTFYNQKYRR AVVWHSLVAS PSGYATSSRE LVIELDRCAI DVRLACIWGN DFTEPLTGDP RIDQLRARLK DSRLPQVVYH QGDSFIKNSG RYRIGYTMLE TDRLPDEWVY QANQMDEVWT PTHWGAEVFC ASGVRRPISV VPLGINPDYF HPGITGHKPG NRFVFLSIFE WIERKAPELL IRAYQQTFRR SDDVVLLLKI FNHDPSLDVA RRIGDLIRSD GPPIVVLPNQ HVAAYQVGCL YRSADCFVLP TRGEGWGMPA LEAMACGLPV ISTAWGGQTE FLHSGVAYPL RIRGLVPAEA RAPYYRGLRW ADPDFDHLCA LMRHVYEHPD EARAVGMRAA AEAAARWTWS HAAAKIIERL EAIE
|
| |