Gene Rcas_0650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0650 
Symbol 
ID5538113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp856636 
End bp857865 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content60% 
IMG OID640892807 
ProductABC transporter related 
Protein accessionYP_001430793 
Protein GI156740664 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1134] ABC-type polysaccharide/polyol phosphate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAG CAATCGAATT CGAGGACGTT TCCAAACGGT TTCTATTGCA GCGTGATCGC 
CCTGTGACCA TTCAGGAGCG TCTGGCAGGA TGGTTACACC CGGCTAAGGC TGCCGATGAA
TTTTGGGCGT TGCGCAACGT GAGCTTCAGC ATTGCTCGTG GTGAAAGTTT TGGACTGATC
GGGCACAATG GCGCCGGAAA GAGCACCGCG CTCAAGTTGA TGACCCGCAT CCTCGAACCG
ACAGCGGGTC GGGTGCGTTT GCGGGGACGT GTGGCAGCGC TCCTTGAGCT TGGCAGCGGG
TTTCATCCAG AGTTGAGCGG TCGCGATAAT GTGTTTCTCT ACGGTTCACT TATGGGACTC
AGCCGACGCG ACATGGCAGC GCGTCTGGAG GAGATCGTGG CATTCGCCGA CATGGCCGAC
TTCCTCGATC TCCAGGTCAA GTACTATTCC TCTGGCATGT ACACGCGCCT GGCGTTTGCG
GTCGCAACGG CAGTAGACCC CGATATTCTG ATTACCGACG AAGTGCTGGC GGTTGGCGAC
GAGGCGTTTC AGCGCAAGTG CATGGATCGC ATTTTGACAT TCCGCCACGC GGGAAAGACG
ATTGTGTTTG TGTCGCATGC GCTCGATACG GTGCGGACCC TGTGCGATCA TGCCGTCTGG
CTGGATCGCG GCGTGGTTCG TGCGCTCGGA TCAACCGGTG AGGTGATTGA CGCCTACCTC
GCCGAGGTGA ACCGTCGTGA ACGTGAGGCG TTGGCGCGGC ACGAGATGTT GGCTTTGGAG
TCCAACCGAC GGTTCGGCAC GCGCGAAGTC GAGATTACCG GTGTGGAACT GCTCGATACC
GATGGGGTTG CGCGGGCGGT GGCGCACACC GGCGCACCAT TGACCATTCG GATTCGCTAT
CATGCCTGGC AGAACGTGCC GCGTCCCGTT TTCGGTCTGG CAATTCATCA CGAAAGCGGC
ATCCTTCTCG CCGGACCAAA TACATTGTTT GCGGGTCTCG ACATTCCTGC CGTACAGGGC
GGAGGGACGG TTGAGATGCG TATTCCGGCG TTGCCGCTGC TGGCTGGGCG GTACCTCCTC
AGCGCAGCCG TGTACGACGA AACCATGTTG CACGCCTACG ATCATCACGA CCGCCTCTAT
CGCTTTACCG TCCAGAATGA AGGGGGAAGG GAACGCTTCG GCGCAGTGAC GCTGGGAGGC
GTTTGGTCCT GGCGCGCGGC GGGCGCGTGA
 
Protein sequence
MSAAIEFEDV SKRFLLQRDR PVTIQERLAG WLHPAKAADE FWALRNVSFS IARGESFGLI 
GHNGAGKSTA LKLMTRILEP TAGRVRLRGR VAALLELGSG FHPELSGRDN VFLYGSLMGL
SRRDMAARLE EIVAFADMAD FLDLQVKYYS SGMYTRLAFA VATAVDPDIL ITDEVLAVGD
EAFQRKCMDR ILTFRHAGKT IVFVSHALDT VRTLCDHAVW LDRGVVRALG STGEVIDAYL
AEVNRREREA LARHEMLALE SNRRFGTREV EITGVELLDT DGVARAVAHT GAPLTIRIRY
HAWQNVPRPV FGLAIHHESG ILLAGPNTLF AGLDIPAVQG GGTVEMRIPA LPLLAGRYLL
SAAVYDETML HAYDHHDRLY RFTVQNEGGR ERFGAVTLGG VWSWRAAGA