Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3567 |
Symbol | |
ID | 5541068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 4657124 |
End bp | 4659151 |
Gene Length | 2028 bp |
Protein Length | 675 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640895686 |
Product | glycoside hydrolase clan GH-D |
Protein accession | YP_001433634 |
Protein GI | 156743505 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.879137 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.341251 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCACT CTGGATCAGG CGCCACGCCT GAGACATGGA CGCTCTCGAC AGACACCGTA ACCATCACCG TGTCGCCGGC GGACGGGCGC CTGACCTGGA ATCTCAGCGG CGCAACTGGA GGGCTTGACC TGTCATCGGT GACGGGGAGC GCTTTGCGTG TCGATGGACG GATGCCGCTC TGGACGCAGA TTGTCGCTGT CGATGAGCAG TCCGTCGGAG ATGGCAGTCG CCTGTTGGCG CTAACGCTCG ATACGGCGGA TGGCGGGCTG CGATTGATGC GCTGGTTTCA CCTGTTCGCC GATCATCCGT TTGTGCGCAC CTGGGCGACG CTGGAGAATC GCAGCAGTGC GACGGTGCGG ATCGATCAGT GCGATATTCT GACGCTGACG CCACACGCCC CGCCGCCACT CTATCTGTTC CACGTCGAAC AGTTCAGTTG GAATTACCGG CGCGATTTCT TCAGTCAGCA CGAGGTCTGG CTGCGGGTGG GATGCGCTCC GCACGAGATC CGCATGGGAT CGCACCCCGC CCACCACTGG GGTCCCTCCA GTTGCGCGTG GTTCGCCCTG CGCGACGGTT CCCCCAACTG GAACGACGAG CCGCCGGAGG GGGGGCGCGG CATGGTGTGC GGCATCGAGT TCAACGGCAA GAGCCGGTTG CACGCCTGGG CAACCACCGA ACGGGTGAAT CTGGTGAGCC AGATCGATGA CCTGGCGCAC CGCCTCGCAC CAGGCGCGAT CTTCGAGATT CCGGCGTTTT TCGTCGGGCG CTTCGAGGGG GATTGGGACG AAGCCGGGTA TGTGACACAG CGTTTCGCCG AGGCGCATGT CCATCCCCCG ATGCCCGACG ACCGCTACCC GTGGGTGCAG TACAATTCCT GGCGGTACGA GCAGAACATC AACGAGGAGC AGCAACTGGC AGCCATCGAC CGCTGCGCGG AACTCGGCGT CGAACTTGTG GTGATGGACC TGGGGTGGGC ACGTATGATC GGCGACTGGC GCCCCGACCC GATCAAGTTT CCGCGCGGGC TGAAGCCGCT GGTCGAACGG GCGCACTCCT ACGGTATGCG GTTTGGCGTT CATGTTGCGC TGGCGCAGTG CAACCCGGAA GCGCCGGTCG CCAGAGCGCA CCCCGACTGG CTCATTCACA CCGGCAACGA TTACTATGGC GCCGGTCCGC TCTGCCTGGG ACACGAGCCG TGCCGCCAAT GGCTCATCGA GCAGTTGATC CGGCTGGTGG ACGAAGAGGG GATCGACTAC ATCATCCAGG ACGGCGAGGA TATGGTGAAG CGGTGCGAGC GCAGCGATCA CACGCATGCA CCGGGCGACA GCAACTACGC CAACTCACAG TATGGGCTGG ACATCGTCAT CGAATCGCTG CGCCGCGCCT GCCCACACCT GGTGCTCGAA AATTGCGAGG ACGGCGGATG CATGATGACC TATAAAATGG CGCGTCTGTA CCATACCAGC ATCACGGTGG ACAATACTTC GTCCTATGCC ACACGACAGG GGGTGTACGG CGCGTCTTAC CCGTTCTCGC CGCGCTATAG CGTGCGCTAC ATGCAGGACG ATCCCTCGCC CTACACCCTG CGCAGTTCAA TTTTCGGCGG GCCTCTCATC CTGATGCAGC GCGTCACCGA GTGGAACGAA GCGCAGATGG CGGAAACCAG GCGTGCTATC GAACAATATA AAGCATTGCG CCATCTCATC CGTTCGGCGA AGATCATCCA TCTGAAAGCG CCGCTGCACA ACATCGATGG TCTGGGATGG GGATGGGACG CTCTCCAGGC AGTGGCGCCC GACCAATCCC GCAGCGTCGT GATGGTATAC CGCGCGCAGG GAGATGTCGC CGAGCGCACC TTCAAGCCGC GCGGTCTGCT CCCAAACGCG GCGTATCTGG TCCGCTACGT CGATAGCGGG CGCACCCTGC AACGCACCGG CGCCGAACTG GAGCGCGACG GGATCACCGT GGCGCTAGAA GAGTTCAGTT CAGAGCTCGT TATGATAGAG GTGGAGGGTG AAAGGTAG
|
Protein sequence | MTHSGSGATP ETWTLSTDTV TITVSPADGR LTWNLSGATG GLDLSSVTGS ALRVDGRMPL WTQIVAVDEQ SVGDGSRLLA LTLDTADGGL RLMRWFHLFA DHPFVRTWAT LENRSSATVR IDQCDILTLT PHAPPPLYLF HVEQFSWNYR RDFFSQHEVW LRVGCAPHEI RMGSHPAHHW GPSSCAWFAL RDGSPNWNDE PPEGGRGMVC GIEFNGKSRL HAWATTERVN LVSQIDDLAH RLAPGAIFEI PAFFVGRFEG DWDEAGYVTQ RFAEAHVHPP MPDDRYPWVQ YNSWRYEQNI NEEQQLAAID RCAELGVELV VMDLGWARMI GDWRPDPIKF PRGLKPLVER AHSYGMRFGV HVALAQCNPE APVARAHPDW LIHTGNDYYG AGPLCLGHEP CRQWLIEQLI RLVDEEGIDY IIQDGEDMVK RCERSDHTHA PGDSNYANSQ YGLDIVIESL RRACPHLVLE NCEDGGCMMT YKMARLYHTS ITVDNTSSYA TRQGVYGASY PFSPRYSVRY MQDDPSPYTL RSSIFGGPLI LMQRVTEWNE AQMAETRRAI EQYKALRHLI RSAKIIHLKA PLHNIDGLGW GWDALQAVAP DQSRSVVMVY RAQGDVAERT FKPRGLLPNA AYLVRYVDSG RTLQRTGAEL ERDGITVALE EFSSELVMIE VEGER
|
| |