Gene Rcas_4300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4300 
Symbol 
ID5541811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5547415 
End bp5548524 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content58% 
IMG OID640896406 
Productphosphate binding protein 
Protein accessionYP_001434344 
Protein GI156744215 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID[TIGR02136] phosphate binding protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCAGCA GGCTTTATCG CGGCACTGTG CTTTCAATTG CCCTGCTTCT CTCGGCGTGC 
ACCGGGGCCG CTGCACCATC TCCCACGCCC ACGCCGTCCC CTGCGCCGAC TGCCGCACCA
AGCGCCACTC CGCAGATCGT CTCTATTCAG CCAACCGCAA CGCCTTTTCC TGAGTCGATG
ACGCGCCTCG ATCTGAGTGG CGAGATTATT ATCGACGGCT CAAGCACCGT GTACCCGATC
ACCGAACTTG CGATGCAACA GTTCGCAGCG GTGGCGCCAC GGGTGACTAT TCAACTCGGA
GTCAGCGGCA CCGGCGGCGG GTTCAAGAAA TTCTGTGCAG GCATTACCGA CATCTCGAAC
GCTTCACGAC CGATTAAGCC GGACGAGAGC GACACCTGCC GCGCCAATGG CATCGCGTTC
GTCGAAATAC CGGTTGCTTT CGATGGCATC TCGGTGATCA TCAATCGAGA CAACACATGG
GCGCAGTGCA TGACCGTCGA TGATCTGAAA CGGATGTGGG CGCCCGAGTC GGAAGGAAGC
GTGACCACAT GGCGGCAAAT ACGCTCCGAC TGGCCCGATC AACCGTTCAA ACTGTACGCG
CCAGGGGTTG ACTCCGGAAC ACACGACTAC TTCACTGCGG CGATCGTTGG CAAGGAAGAT
GCCAGTCGCA ATGATTATAT CGGCAGCGAA GACGATTATG TGCTCATGCA GCGCGTCATC
GAAGATGCGC AGGGGATTGC GTATGTCGGA TACGCCTACT ACCAGGAGTA TGCCGACAAA
GTCGGCGTAG TAGCCGTCGA TGCAGGGCAG GGGTGCGTAT CGCCCTCACT CACCACGATC
ACGGAAGGCA CATATACACC ACTATCACGT CCATTGTTCA TTTATGTGCG CGCTGATCGT
CTCGACCGAC CGGCGATGCT GGCATTCGTT GAATTCTATA TCAACCGCGC AGAACAACTG
GTTCAAGATG CGCGCTACAT CCCGTTGCCG CAGCGCGCCT ATGAACTGGT GCAGCAGCGC
GTTGACAGGC GGGTGACAGG TTCAATCTTC GACAAGCCGG TGCCGGTTGG CGTTTCAATC
GATGAGTTGC TGATGCTGGA GGGGCAGTGA
 
Protein sequence
MFSRLYRGTV LSIALLLSAC TGAAAPSPTP TPSPAPTAAP SATPQIVSIQ PTATPFPESM 
TRLDLSGEII IDGSSTVYPI TELAMQQFAA VAPRVTIQLG VSGTGGGFKK FCAGITDISN
ASRPIKPDES DTCRANGIAF VEIPVAFDGI SVIINRDNTW AQCMTVDDLK RMWAPESEGS
VTTWRQIRSD WPDQPFKLYA PGVDSGTHDY FTAAIVGKED ASRNDYIGSE DDYVLMQRVI
EDAQGIAYVG YAYYQEYADK VGVVAVDAGQ GCVSPSLTTI TEGTYTPLSR PLFIYVRADR
LDRPAMLAFV EFYINRAEQL VQDARYIPLP QRAYELVQQR VDRRVTGSIF DKPVPVGVSI
DELLMLEGQ