Gene Rcas_2823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2823 
Symbol 
ID5540310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3659987 
End bp3661552 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content61% 
IMG OID640894950 
Producttail sheath protein 
Protein accessionYP_001432912 
Protein GI156742783 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.43909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAGT ATCTCTCGCC TGGCGTCTAC ATCGAAGAAG TCAGCAGCGG TCCGCGTCCG 
ATTGAGGGGG TTGGCACGGC AATGGCGGCG TTCGTCGGGT TCGCCGCTGC CGGTCCCGTT
AATCAACCTG TGCTGGTGAC CAGTTGGACG CAGTATGTCG AGAAGTTTGG TCGTCTCGAT
GAAAGCGGGC GGCGCAACCC ACACATGGAT GGCGCCTATC TGTCGCATGC CGTCTACGGC
TACTTTCTCA ACGGCGGCGG TCGGTGCTAT GTGACGCGCA TCCCGCAGCA GGCGGACGGC
AAAGCGCCTC CTCCGCCGCG CCTCGAACTC CCGACGCGGG CCTCGAAGGC GCTGACCTCG
CTGATTGTCA CACCCAAGAG CGAAACTGCC AGCGACATTC AAGTGGAGAT CGGTCCGCCG
GTTGGCGAAA ATCCGCCTCC CGAAGCGTTT ACGGTCAAAA TCAGCATGGG GGAAGTGAAG
GAAGTCTACG AGAATGTGTC GTTCAACAAA CGACCAAAAG ATGGAACCTC TTACGTGGTC
GAGAAGATCA ACAGTTCCAG CACGCTGGTG CAGGTCGCTG AAGGACCGGC GACCGGCTCG
CTGGCGGACC GTGTGCCGGA GTTTGGCATG TCGGTCATCA AGCCGCTGGC GCCGATCGTT
CCGGCGCGCG TGGATGCGAC GACATTCGTC GGTAGCGCCG CCGAGCGCAG CGGTGTCGAG
GGATTGGAGA TCGCCGAGGA TGTGACCATG ATCTGCGCGC CAGACCTGAT GGCAGCCTAT
CAGTCGGGCG CAATCACGAA AGAAGGAGTC AAAGCTGTCC AACTGGCGAT GATTGCCCAC
GCCGAACGCA TGCAGGATCG CATGGTCATT CTCGATCCGC TTCCTGGTCT GACGCCGCAG
CAGGTCAAGC AGTGGCGCGA GCGCGACACG AACTACGACT CGAAGTTTGC CGTGCTCTAC
TACCCCTGGC TCAAAATCAT GGGACCGGAC GGCAAGACGG AGATGGAGAT TCCGCCGTGC
GGACACATTG CGGGCATCTG GGCGCGCAAC GACAATACGC GCGGCGTCCA CAAAGCGCCG
GCGAACGAGG TTGTTCAGGG TGCGCTTGGA CCGGCGATTG CGATCACGAA AGGCGAGCAG
GATGTGCTCA ACCCGATTGG GGTCAACTGC ATCCGGTCGT TCACCGGTAT GGGGTTGCGG
GTCTGGGGTG CACGCACCCT CTCCAGCGAT GCCGCCTGGC GCTACGTCAA TGTGCGGCGT
CTTTTCAACT ACGTCGAGAA GTCAATCGAA CGCGGGACGC AGTGGGTTGT CTTCGAGCCG
AACGACCCCA ACCTGTGGGC GCGCGTCAAG CGTGACGTGG AAGCGTTCCT GACCGTCTGC
TGGCGTGATG GCATGCTGTT TGGTCTGACA CCGCGCGAGG CGTTCTATGT CAAGTGTGAC
GAAGAACTGA ACCCGCCCGA AGTGCGCGAT CAGGGCAAAC TGATCATCGA AGTCGGGCTG
GCGCCAGTCA AACCCGCCGA GTTCGTCATC TTCCGCTTCA GCCAGTTCGC TGGCGGCGGG
GCATAA
 
Protein sequence
MPEYLSPGVY IEEVSSGPRP IEGVGTAMAA FVGFAAAGPV NQPVLVTSWT QYVEKFGRLD 
ESGRRNPHMD GAYLSHAVYG YFLNGGGRCY VTRIPQQADG KAPPPPRLEL PTRASKALTS
LIVTPKSETA SDIQVEIGPP VGENPPPEAF TVKISMGEVK EVYENVSFNK RPKDGTSYVV
EKINSSSTLV QVAEGPATGS LADRVPEFGM SVIKPLAPIV PARVDATTFV GSAAERSGVE
GLEIAEDVTM ICAPDLMAAY QSGAITKEGV KAVQLAMIAH AERMQDRMVI LDPLPGLTPQ
QVKQWRERDT NYDSKFAVLY YPWLKIMGPD GKTEMEIPPC GHIAGIWARN DNTRGVHKAP
ANEVVQGALG PAIAITKGEQ DVLNPIGVNC IRSFTGMGLR VWGARTLSSD AAWRYVNVRR
LFNYVEKSIE RGTQWVVFEP NDPNLWARVK RDVEAFLTVC WRDGMLFGLT PREAFYVKCD
EELNPPEVRD QGKLIIEVGL APVKPAEFVI FRFSQFAGGG A