Gene Rcas_0423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0423 
Symbol 
ID5537885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp535938 
End bp538412 
Gene Length2475 bp 
Protein Length824 aa 
Translation table11 
GC content64% 
IMG OID640892585 
Productalpha beta-propellor repeat-containing integrin 
Protein accessionYP_001430572 
Protein GI156740443 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.657056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCATC CCGCCATGTT CAGTGCGCGT CTTGCCGGCG TCCTGGTGAT CGCGCTGATT 
GCGTTCTTGA CGCCCGTTCG TTTGGCGCTG CCGCAATCAT CCCTGAACTT CACGCAAACG
AAACTGACTG CCGCTGATGC CGCTCAGTAC GATTATTTCG GTCTGTCAGT CGCGCTTGCA
GGTGACACGG CAATCGTTGG CGCCTACGGC AAGTCGGACC TGGCGCCCAA CGCTGGCGCC
GCCTACGCTT TTGCCCGCAG CACTGCATCC TGGGTGCAGC AGGCGCGTCT TGGCATATCC
GATGTTCTGG CAGGCGCGTA CCTCGGCGCA GTAGTGGCGA CCGATGGCGT GCAGACAGCG
GTGGGAGCGC CGTATGCCGG CATTGATGCG CAGGATGCCG GCGCGGTTTA TCTCTTCTCG
AACGCTAGCT GGCAACGCCA GGCGATCATC ACTCCCGCTG ATCCTGAATC ACCGGCGCAG
TTTGGCGGCG CTATCGCAAT CGGTCAGAAC ACGCTGATCG TTGGTGCGCC GTTGCACGAC
TCCTACGGCA GGGATGCTGG CGCGGTCTAT GTCTTCGCCT TCGATGGCGT TGCGTGGGTG
CAACGCCAGA AACTGATCGG CGCCGATGTT GTTCCTGGCG ACCGCTTCGG CAGCGCACTG
GCGCTGAGCG ATGGCTGGCT TGCGGTCAGC GCGCCACTGC ATGGCGCGGG TGGGGCGGTC
TATCTCTTCG AATTCGATGG CGTTGCCTGG GTGCAGCGGC ACAAAGTGTC CGCTGGCGAC
ACGATTGCCG GAGATCGCTT CGGCAGCGCA CTGGCGTTGA ACAATGGCTG GCTTGCGGTT
GGGGCGCCAT TGCATCGCGT TACCGGCAGT TTCAGCGGAG CCGTCTATCT TTTTGAGTTT
AACGGCGCGT CCTGGCCGCA GCGCCAGAAG TTCGTAGCAA GCGATACGGT CGCCAGCGAT
CGCTTCGGTA GCGCACTGGC GCTTTCCGGT CAGCGACTCG TCGTCGGAGC GCCATTGCAT
AGTGCGAACG GACCTGCCAG TGGCGCAGTC TATGTGTTCG ACCGTAGCGG CGCAACCTGG
ATCGAACGCG CAAAACTGAT CGGCAGCGAC ACCGATAGCG GTGATCGCCT CGGTTGGTCG
GTTGACATCG ATGGCAATAC CATTATTGCC GGGGCGTATG GCGATGCGCT CTTTGGTCCG
GCAACCGGTG CAGCGTATGT TTTTGTCGAT GTGACCGGTG CAGGGGCAAC CAATACGCCG
TTACCGATCG TCACCGCCAC GCCAACGGCG ACGGCCACGC CAACGGCGAC GGCCACGCCA
ACGGCGACGG CCACGCCAAC GGCGACGGCC ACGCCAACGG CGACGGCGAC GGCCACGAAC
ACGGCCACGG CGACGCCAAC GGCGACGGCC ACTGCCACGA ACACGGCGAC GGCGACGGCC
ACACCAACGG CGAGCCACAC GGCGACGGCA ACGGCAACGG CGACGTCCAC TGCCACGAAC
ACGGCGACGC CAACGCCAAC GGCGACGCCA ACGGCAACGG CGACAGCCAC GAACACGGCA
ACGGCGACGG CCACTCCTGT GCCGACGGCC ACACCGACGG CCACGCCAAC GGCCACTCCT
GCGCCGACGG CCACACCGAC GGCCACACCG ACGGCGAGCC ACACGGCAAC GGCGACGTCC
ACTGCCACGA ACACGGCGAC GCCAACGGCG ACGCCAACGG CGACGGCCAC TGCCACGAAC
ACGGCCACGG CGACGGCCAC TGCCACGAAC ACGGCGACGG CGACGCCAAC GGCGACGGCG
ACGCCAACGG CGACGGCAAC GGCAACGGCG ACGGCGACGG CGACGCCCAC ACCAATGGCG
ACGGCCACAC CGTCACCGAC GGCAACCGTC ACGCCTGTAA TCCTGCGTCC GTTTCTCGCA
TGTGTTGCGC GGCGCGCACC GGCAGGTTAT GTCGCGCTCT TCGGCTACGA GGTGCAAGGT
GATGCGTCTG TGCAGGTTCC AATCGGCGCC GACAATCGCT TCAATCGCTA TCGAGAGAAC
CTTGGGCAGC CGACGACCTT CGAGCCAGGG AAGCGCAGAG TTGCATTTGC GGTGGTTTTC
GATGGTCTGC CGCTCACATG GTCGCTGAAT GGGCAGACAG TCACGGCTCA TGCGAATTAT
CCGATCCGCT GTGGCAGCGA TGCCGTGCTC CGTATTCAGC CCATTCTGGA ATGCACGCTG
CCCGATGGCA ATGGCGCTTC GATTGCGCGG TTCGGCTATC GGAACGATAA CGCCTTCAAT
GTTGCCGTGC CCGTTTGGTG GCAGAATTTC TTTGTTCCGC GACCGATCCA ACGCGGCCAG
CCAATCGTAT TTGCGCCCGG TCGTCATCGG AATGTTTTCT CAACCGGTTT CTCACAGGGG
GCGTTGGTGT GGCTCCTCGA TGGACGGATC GCGGTAGCGA CCGACTCGCC GGTGCAGGCG
TGCCGGTTCA ACTGA
 
Protein sequence
MRHPAMFSAR LAGVLVIALI AFLTPVRLAL PQSSLNFTQT KLTAADAAQY DYFGLSVALA 
GDTAIVGAYG KSDLAPNAGA AYAFARSTAS WVQQARLGIS DVLAGAYLGA VVATDGVQTA
VGAPYAGIDA QDAGAVYLFS NASWQRQAII TPADPESPAQ FGGAIAIGQN TLIVGAPLHD
SYGRDAGAVY VFAFDGVAWV QRQKLIGADV VPGDRFGSAL ALSDGWLAVS APLHGAGGAV
YLFEFDGVAW VQRHKVSAGD TIAGDRFGSA LALNNGWLAV GAPLHRVTGS FSGAVYLFEF
NGASWPQRQK FVASDTVASD RFGSALALSG QRLVVGAPLH SANGPASGAV YVFDRSGATW
IERAKLIGSD TDSGDRLGWS VDIDGNTIIA GAYGDALFGP ATGAAYVFVD VTGAGATNTP
LPIVTATPTA TATPTATATP TATATPTATA TPTATATATN TATATPTATA TATNTATATA
TPTASHTATA TATATSTATN TATPTPTATP TATATATNTA TATATPVPTA TPTATPTATP
APTATPTATP TASHTATATS TATNTATPTA TPTATATATN TATATATATN TATATPTATA
TPTATATATA TATATPTPMA TATPSPTATV TPVILRPFLA CVARRAPAGY VALFGYEVQG
DASVQVPIGA DNRFNRYREN LGQPTTFEPG KRRVAFAVVF DGLPLTWSLN GQTVTAHANY
PIRCGSDAVL RIQPILECTL PDGNGASIAR FGYRNDNAFN VAVPVWWQNF FVPRPIQRGQ
PIVFAPGRHR NVFSTGFSQG ALVWLLDGRI AVATDSPVQA CRFN