Gene Rcas_2217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2217 
Symbol 
ID5539698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2863973 
End bp2866003 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content62% 
IMG OID640894350 
Producthypothetical protein 
Protein accessionYP_001432318 
Protein GI156742189 
COG category 
COG ID 
TIGRFAM ID[TIGR01319] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000677217 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCGTCC CAATCGCATT GCATGCCTTT CTGGTCGCCG ATGTCGGCAG CACAATGACG 
CATGTCTGGC TGGTTGACGC CGTCGATGGC GAAACTCGTC TTATCGGCTA TGCCGAAGCG
CCGGGCAGCG TTCCTTCGAC CGGCGACGCA ACACCCGCCA TTCTCGAAGC CGTACAGCGC
ATCGCAGAAC AAACCGGTCG CCGTCTGATC GACAACAATA CGCTGGTGAT GCCGAAAGAG
GCGGAGGGCG ATGGTGTTGA CGGCATTCTG GTCTGTAGCA GTGCAGCAGG CGTTATGAGC
CTGATCATTG CGGCGGTCGC CGGCGATATT TCGGCGCGGA GCGCGCAACG CGCCGCGCGT
GCAACATATA CTCGCGTCCT TCAAACCATC ACGCTCGATG ATGCAGTTCA CCAGGAACAG
ATCGGCGTCC TCGCAGATTC GGGCATGACC TGGATCGAAC GCCAGGTGCA GGCGTTGCTC
GGAGTGCAGG CTGACGGCGT AGTGATTGTC GGCGGAATCG AAGGCGGCGC GCACGACGCG
CTCATTCGCC TCGCACATAT CGTCGGACTG GCATCACGGA GCGTTCAGAC AGACGCCCAG
GGCAGACAAA CCTACAATGC CGCCCGAAAA CCGATCATTT TCGCAGGAAA CAGTCAGGCG
CGTGCAGGGG TGGCTGCGGC GCTGGAGGAC CACCCCGATC TCATCGTGGT TGACAATATT
CGACCAACCC TGGACATCGA GCGCCTCGAT CCGGTGCGCC GCGAAATCGT GCGCTTCTAC
AACGAGCACA TCCTGACCCG CTTCGCGCGA ACATCGGGCC TTCAGCGCCT CTCTCGCGCG
CCTGTGTGCA CATCGTGCGA CGCCACGGGT GTGATAACCC GCTTCGCGGC GGAGACGGCG
CAGTGTAACG TTCTGACCCT CGATGTCGGC TCATTGAGCA CGACTGCGCA CTTGTGCAGC
GAAGGACGCT ATAGTCCCGT CGTTTTGGGC GGCGTCGGCA TCGGGTATGG GGTTGGGGCG
CTCCTGGCGC AACGCGGAGT CGGCGCCATC CGACGCTGGC TGCCCTTCCC GATCAGTGAG
CGCGACCTGG CACACTGGCT CCTCAACAAG ATGTTGCGTC CGCACATTCC ACCTCTGACC
CGCGAAGAAC TGCTGATCGA GCACGCAGTA GCGCGCGAGG CGCTTTCCCG CGTGATAGAG
ACGCTCCTGG ACGAGCGACC CGATGCACGA TACGACCGCG TCTTCGTTGG TGGCGGAGTG
TTGCGCCATG CACCCCATCC CGGCCTTGCA TTGCTCACCG TTCTGGATGC ACTGCAACCA
ACCTCTCAAG AGAATATCAT GACACTCGAT GTGCACCTTG ATAGCCTGGG GTTAATGAAT
GCCTGCGGCA CACTCGCCTT TTCCGAAGCC GACGCCGCGC TGACGTTGTT CGAGCGTGAC
CTGATGAACA ACACGCCGCT GGCGACGGTC GTCACAACGC TTGGCGAAGG GCGTGCAGGA
GAAACGGCAG TTGAAGCCGA GTTGCGGGTG GAAGGCAAGT CCACCTATAC GATGCGTGTC
GCTCATGGTG AGATCGCATG TTTGAGCCTG CCCCCTGGCC AGTACGGCAC GCTGACGCTG
CGACCAACCG CTGGTGTGCG GATCGGGCGC AACGCACCGG GCGCCGAAGT CGCCTCAGAA
CTGGCGGCCA TTCGCGGCAG CGCCCTTGGT GTGGTCATCG ACGCGCGCGG CAGACCGCTA
CGCCTGCCGG ACGAGCCAGC GGCGCGGCAG CAGGCGCTCT GGTCGTGGTT AGTGGCGCTT
GGCGTCGAGC GCGAACCATT GCCATATCCG GCGCTCGACA CGGTTATCGA AGCGCCGTCG
CCGACTCTGT CTTCCACAGG GAGTGAGCCG CACAGCAGCC GAGCGTCACT CTTGCAATCC
GACGAACGCC CGTCAACGGA GTCAGGCGAC AGCATCGAAC GTGATCTGGC AAAACTGCGT
GAGACAGTCG AGACCCCCCA GAAAAGGCGT GGGCTTTTCC GCCGAAATTG A
 
Protein sequence
MSVPIALHAF LVADVGSTMT HVWLVDAVDG ETRLIGYAEA PGSVPSTGDA TPAILEAVQR 
IAEQTGRRLI DNNTLVMPKE AEGDGVDGIL VCSSAAGVMS LIIAAVAGDI SARSAQRAAR
ATYTRVLQTI TLDDAVHQEQ IGVLADSGMT WIERQVQALL GVQADGVVIV GGIEGGAHDA
LIRLAHIVGL ASRSVQTDAQ GRQTYNAARK PIIFAGNSQA RAGVAAALED HPDLIVVDNI
RPTLDIERLD PVRREIVRFY NEHILTRFAR TSGLQRLSRA PVCTSCDATG VITRFAAETA
QCNVLTLDVG SLSTTAHLCS EGRYSPVVLG GVGIGYGVGA LLAQRGVGAI RRWLPFPISE
RDLAHWLLNK MLRPHIPPLT REELLIEHAV AREALSRVIE TLLDERPDAR YDRVFVGGGV
LRHAPHPGLA LLTVLDALQP TSQENIMTLD VHLDSLGLMN ACGTLAFSEA DAALTLFERD
LMNNTPLATV VTTLGEGRAG ETAVEAELRV EGKSTYTMRV AHGEIACLSL PPGQYGTLTL
RPTAGVRIGR NAPGAEVASE LAAIRGSALG VVIDARGRPL RLPDEPAARQ QALWSWLVAL
GVEREPLPYP ALDTVIEAPS PTLSSTGSEP HSSRASLLQS DERPSTESGD SIERDLAKLR
ETVETPQKRR GLFRRN