Gene Rcas_0331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0331 
Symbol 
ID5537793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp408326 
End bp409756 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content61% 
IMG OID640892494 
Producthypothetical protein 
Protein accessionYP_001430481 
Protein GI156740352 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.638038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGTTG TTTGGTGGAT CCTGATCGCG GCGCTGCTCG TGTCATGTGG TGCGACACCG 
ATTGCTGTGG CGCCGACTGT CATGACCCCA TCTCCTGCTC TCCTCTCCCC ACTTCCGACT
GCGACTCCGC CGCCAACGGT CGCCGCCACG CCAACGCCTT TGCAGAGCCC GACTGCCACT
TCGCCGCCAA CGGTCGCCGC CACGCCAACG CCTTTGCAGA GCCCGACTGC CACTTCGCCG
CCAACGGTCG CCGCCACGCC AACGCCTTCA CAGATCCCGA CTGCGAATCC AACGCCGGTT
CGCGCCACTC CGCTTCAATT CGGCGTAGCG GCGCACCTGT TCTACACGTC GCGCCGATTG
CCGCTGCAAC GCGCCAGAGA GGCGGGTTTC GCCTGGATTC GTCAGCAGAT TCATTGGAAA
GACCTGGAAG GTTCGCCAGG ACGCTACGCC TGGGGTGAAC TGCCCGCCGT CGTTGATGCC
GTCAATGATG CTGGACTTCG CCTGCTTATC ACGATTGTGC GCGCGCCGGC GTTCTACTCG
CGTGGAACCT ATGGCATGCC GGACGACCCG ACGCGCCTGG GGGATTTCGT CAATGCGCTG
GTGCAGCAGT TTCCCGGCAA GATTCACGCC ATCGAAATCT GGAACGAACA AAATCTGGCG
CATGAAAACG GCGGGCGCGT CACCATCGAA GATGCTGGAC GGTATGTTGA ACTGCTCTGC
TCCACCTATC CGCGTATCAA AGCGCGCGAT CCGTCGATTG TCGCGCTGGC GGGCGCCCCA
TCCTCGACCG GCATCACTGA TGCGCGAATG GCGGTGGACG ATAGAGCGTA CTATTCAGCA
ATGTACACAT ATCGAGAAGG TATGATTCGG GACTGTATGG ATGCGCAGGC AGTGCATCCT
GGCGGCGCGG CGAACCCGCC GGACACTCTC TGGCCCATTA ATCCGAGCAC GGCGCGCGGT
TGGACTGACC ATCCCACGTT TTATTTCCGC CACGTCGAGA ATGTGCGCCG CGTGATGGTC
GCGCATGGTC TGCAAGACAT TCCGATCTGG ATCACCGAAT TTGGTTGGGC GACGCGCAAC
ACAACGCCGG GGTTCGAGTA TGGAAATCAG GTATCATTCG AACAACAGGC GGAATACATT
GTTGGTGCGA TGCGGTTGAC AGAAGAGCAG TACCCATGGG TCGAGGCGAT GTTTCTGTGG
AACCTGAACT TTGCCATCCT GCTCGCGCGT AGTGGTCAAC CGCTGCACGA ACAGGCGTCG
TTCGCCATTC TCAATGCCGA TGGCAGTCCG CGTCCGGCGT TTATCGCCAT TCAACAGTAT
CTCAGCGAGG CGAGAGGGGA GAGGGGAGAG GCGAGAGGCG AGAGGGGTAA GAGGCAAGGG
ATCATCCGTA GCGGCGTTGC ACCGCACCGC TACGGTGCAC GGCTGGTGTG A
 
Protein sequence
MHVVWWILIA ALLVSCGATP IAVAPTVMTP SPALLSPLPT ATPPPTVAAT PTPLQSPTAT 
SPPTVAATPT PLQSPTATSP PTVAATPTPS QIPTANPTPV RATPLQFGVA AHLFYTSRRL
PLQRAREAGF AWIRQQIHWK DLEGSPGRYA WGELPAVVDA VNDAGLRLLI TIVRAPAFYS
RGTYGMPDDP TRLGDFVNAL VQQFPGKIHA IEIWNEQNLA HENGGRVTIE DAGRYVELLC
STYPRIKARD PSIVALAGAP SSTGITDARM AVDDRAYYSA MYTYREGMIR DCMDAQAVHP
GGAANPPDTL WPINPSTARG WTDHPTFYFR HVENVRRVMV AHGLQDIPIW ITEFGWATRN
TTPGFEYGNQ VSFEQQAEYI VGAMRLTEEQ YPWVEAMFLW NLNFAILLAR SGQPLHEQAS
FAILNADGSP RPAFIAIQQY LSEARGERGE ARGERGKRQG IIRSGVAPHR YGARLV