Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0423 |
Symbol | |
ID | 5537885 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 535938 |
End bp | 538412 |
Gene Length | 2475 bp |
Protein Length | 824 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640892585 |
Product | alpha beta-propellor repeat-containing integrin |
Protein accession | YP_001430572 |
Protein GI | 156740443 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.657056 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCATC CCGCCATGTT CAGTGCGCGT CTTGCCGGCG TCCTGGTGAT CGCGCTGATT GCGTTCTTGA CGCCCGTTCG TTTGGCGCTG CCGCAATCAT CCCTGAACTT CACGCAAACG AAACTGACTG CCGCTGATGC CGCTCAGTAC GATTATTTCG GTCTGTCAGT CGCGCTTGCA GGTGACACGG CAATCGTTGG CGCCTACGGC AAGTCGGACC TGGCGCCCAA CGCTGGCGCC GCCTACGCTT TTGCCCGCAG CACTGCATCC TGGGTGCAGC AGGCGCGTCT TGGCATATCC GATGTTCTGG CAGGCGCGTA CCTCGGCGCA GTAGTGGCGA CCGATGGCGT GCAGACAGCG GTGGGAGCGC CGTATGCCGG CATTGATGCG CAGGATGCCG GCGCGGTTTA TCTCTTCTCG AACGCTAGCT GGCAACGCCA GGCGATCATC ACTCCCGCTG ATCCTGAATC ACCGGCGCAG TTTGGCGGCG CTATCGCAAT CGGTCAGAAC ACGCTGATCG TTGGTGCGCC GTTGCACGAC TCCTACGGCA GGGATGCTGG CGCGGTCTAT GTCTTCGCCT TCGATGGCGT TGCGTGGGTG CAACGCCAGA AACTGATCGG CGCCGATGTT GTTCCTGGCG ACCGCTTCGG CAGCGCACTG GCGCTGAGCG ATGGCTGGCT TGCGGTCAGC GCGCCACTGC ATGGCGCGGG TGGGGCGGTC TATCTCTTCG AATTCGATGG CGTTGCCTGG GTGCAGCGGC ACAAAGTGTC CGCTGGCGAC ACGATTGCCG GAGATCGCTT CGGCAGCGCA CTGGCGTTGA ACAATGGCTG GCTTGCGGTT GGGGCGCCAT TGCATCGCGT TACCGGCAGT TTCAGCGGAG CCGTCTATCT TTTTGAGTTT AACGGCGCGT CCTGGCCGCA GCGCCAGAAG TTCGTAGCAA GCGATACGGT CGCCAGCGAT CGCTTCGGTA GCGCACTGGC GCTTTCCGGT CAGCGACTCG TCGTCGGAGC GCCATTGCAT AGTGCGAACG GACCTGCCAG TGGCGCAGTC TATGTGTTCG ACCGTAGCGG CGCAACCTGG ATCGAACGCG CAAAACTGAT CGGCAGCGAC ACCGATAGCG GTGATCGCCT CGGTTGGTCG GTTGACATCG ATGGCAATAC CATTATTGCC GGGGCGTATG GCGATGCGCT CTTTGGTCCG GCAACCGGTG CAGCGTATGT TTTTGTCGAT GTGACCGGTG CAGGGGCAAC CAATACGCCG TTACCGATCG TCACCGCCAC GCCAACGGCG ACGGCCACGC CAACGGCGAC GGCCACGCCA ACGGCGACGG CCACGCCAAC GGCGACGGCC ACGCCAACGG CGACGGCGAC GGCCACGAAC ACGGCCACGG CGACGCCAAC GGCGACGGCC ACTGCCACGA ACACGGCGAC GGCGACGGCC ACACCAACGG CGAGCCACAC GGCGACGGCA ACGGCAACGG CGACGTCCAC TGCCACGAAC ACGGCGACGC CAACGCCAAC GGCGACGCCA ACGGCAACGG CGACAGCCAC GAACACGGCA ACGGCGACGG CCACTCCTGT GCCGACGGCC ACACCGACGG CCACGCCAAC GGCCACTCCT GCGCCGACGG CCACACCGAC GGCCACACCG ACGGCGAGCC ACACGGCAAC GGCGACGTCC ACTGCCACGA ACACGGCGAC GCCAACGGCG ACGCCAACGG CGACGGCCAC TGCCACGAAC ACGGCCACGG CGACGGCCAC TGCCACGAAC ACGGCGACGG CGACGCCAAC GGCGACGGCG ACGCCAACGG CGACGGCAAC GGCAACGGCG ACGGCGACGG CGACGCCCAC ACCAATGGCG ACGGCCACAC CGTCACCGAC GGCAACCGTC ACGCCTGTAA TCCTGCGTCC GTTTCTCGCA TGTGTTGCGC GGCGCGCACC GGCAGGTTAT GTCGCGCTCT TCGGCTACGA GGTGCAAGGT GATGCGTCTG TGCAGGTTCC AATCGGCGCC GACAATCGCT TCAATCGCTA TCGAGAGAAC CTTGGGCAGC CGACGACCTT CGAGCCAGGG AAGCGCAGAG TTGCATTTGC GGTGGTTTTC GATGGTCTGC CGCTCACATG GTCGCTGAAT GGGCAGACAG TCACGGCTCA TGCGAATTAT CCGATCCGCT GTGGCAGCGA TGCCGTGCTC CGTATTCAGC CCATTCTGGA ATGCACGCTG CCCGATGGCA ATGGCGCTTC GATTGCGCGG TTCGGCTATC GGAACGATAA CGCCTTCAAT GTTGCCGTGC CCGTTTGGTG GCAGAATTTC TTTGTTCCGC GACCGATCCA ACGCGGCCAG CCAATCGTAT TTGCGCCCGG TCGTCATCGG AATGTTTTCT CAACCGGTTT CTCACAGGGG GCGTTGGTGT GGCTCCTCGA TGGACGGATC GCGGTAGCGA CCGACTCGCC GGTGCAGGCG TGCCGGTTCA ACTGA
|
Protein sequence | MRHPAMFSAR LAGVLVIALI AFLTPVRLAL PQSSLNFTQT KLTAADAAQY DYFGLSVALA GDTAIVGAYG KSDLAPNAGA AYAFARSTAS WVQQARLGIS DVLAGAYLGA VVATDGVQTA VGAPYAGIDA QDAGAVYLFS NASWQRQAII TPADPESPAQ FGGAIAIGQN TLIVGAPLHD SYGRDAGAVY VFAFDGVAWV QRQKLIGADV VPGDRFGSAL ALSDGWLAVS APLHGAGGAV YLFEFDGVAW VQRHKVSAGD TIAGDRFGSA LALNNGWLAV GAPLHRVTGS FSGAVYLFEF NGASWPQRQK FVASDTVASD RFGSALALSG QRLVVGAPLH SANGPASGAV YVFDRSGATW IERAKLIGSD TDSGDRLGWS VDIDGNTIIA GAYGDALFGP ATGAAYVFVD VTGAGATNTP LPIVTATPTA TATPTATATP TATATPTATA TPTATATATN TATATPTATA TATNTATATA TPTASHTATA TATATSTATN TATPTPTATP TATATATNTA TATATPVPTA TPTATPTATP APTATPTATP TASHTATATS TATNTATPTA TPTATATATN TATATATATN TATATPTATA TPTATATATA TATATPTPMA TATPSPTATV TPVILRPFLA CVARRAPAGY VALFGYEVQG DASVQVPIGA DNRFNRYREN LGQPTTFEPG KRRVAFAVVF DGLPLTWSLN GQTVTAHANY PIRCGSDAVL RIQPILECTL PDGNGASIAR FGYRNDNAFN VAVPVWWQNF FVPRPIQRGQ PIVFAPGRHR NVFSTGFSQG ALVWLLDGRI AVATDSPVQA CRFN
|
| |