Gene Rcas_3964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3964 
Symbol 
ID5541470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5169785 
End bp5170879 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content64% 
IMG OID640896072 
ProductLamG domain-containing protein 
Protein accessionYP_001434015 
Protein GI156743886 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.278047 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCATG GAGTTCGTTT GCTGGCGCTG CTCGTCTGTG TGGCGTCGGG ATGGTTGCTG 
GCGTTAATGG CCGGCTCCGC ATCGCGTCCG TTGCGGGCAC AGTCGTCGGG CGGCTACGCG
CTCCGGTTCT ACGGCAACGG CGTAAGTGAT ATCGACCGGG TGAAGGTGCG GATCGATCCG
CAGGTTCCCG CCGATGTCGG CGGCGATTTC ACCATCGAGT TCTGGCTGAA AACGACGGCC
GCCGTCGGCG CATGCTCGCC GGGAAGCTCC GGCGCCGGCT GGATTACCGG ACGGACGATC
ATCGACCGGG ATGTGTATGG CAATGGCGAC TACGGCGATT ACGGCATCTC GCTGGCGGCG
GGGCGGATTT GCTTCGGCGT GGAGCGCGGC GCGACGGGAA CGACGATCTA TGGCAGCACG
AATGTGGCGA ATGGTCAGTG GCGACATATC GCGGTGACGC GCAGCGCGAG CAGCGGGCAG
ATGCGCATCT TCGTCGATGG GCAACTCGAC GCGCAGGGAA CCGGTCCGAC CGGCGACATC
AGTTACCGCG ACGGGCGCGC AACAGCGTAC CCGAACAGCG ACCCCTTCCT GGTCTTCGGC
GCCGAGAAGC ACGATGCAGG ATCAGAGTAC CCCTCATACG CAGGGTTGCT CGACGATATC
CGCATCTCGA ATGGGGTGCG CTACACCGGC GTCTTCACAC GCCCAACGGC GCCGCACGCC
GTGGATGGGC AGACGGTCGC GCTCTACCGG TTCGACGAGG GAAGCGGCAC GACAATTATC
GACTCGGCGC CGGATGGCGG CAGCCCTGGC GAGCGGCGGT TCGGTGGTTC ACCCGCCGGT
CCGGTCTATG TCGCCGATAT ACCGTTTAGC GGAGCGCTTC CATCGGCGAC GCCAACGCGC
ACCGTCACAC CAATCTCTGG TCCATTGCCT TCGGCAACAT CAACGCCGAC CGCAACACCA
ACAATGACCT CGGTTGCGTT TACCGCCACG GTCACCAGTA CACCAACAAG GACTACCAAC
CCAACGATTA CACCGATCGT TGGCATTTCT CCCCTGAAGC CGCGCGCCTT CCTGCCCTTT
GTTCAAAAAC CGTAG
 
Protein sequence
MRHGVRLLAL LVCVASGWLL ALMAGSASRP LRAQSSGGYA LRFYGNGVSD IDRVKVRIDP 
QVPADVGGDF TIEFWLKTTA AVGACSPGSS GAGWITGRTI IDRDVYGNGD YGDYGISLAA
GRICFGVERG ATGTTIYGST NVANGQWRHI AVTRSASSGQ MRIFVDGQLD AQGTGPTGDI
SYRDGRATAY PNSDPFLVFG AEKHDAGSEY PSYAGLLDDI RISNGVRYTG VFTRPTAPHA
VDGQTVALYR FDEGSGTTII DSAPDGGSPG ERRFGGSPAG PVYVADIPFS GALPSATPTR
TVTPISGPLP SATSTPTATP TMTSVAFTAT VTSTPTRTTN PTITPIVGIS PLKPRAFLPF
VQKP