Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2823 |
Symbol | |
ID | 5540310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 3659987 |
End bp | 3661552 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640894950 |
Product | tail sheath protein |
Protein accession | YP_001432912 |
Protein GI | 156742783 |
COG category | [R] General function prediction only |
COG ID | [COG3497] Phage tail sheath protein FI |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.43909 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAGT ATCTCTCGCC TGGCGTCTAC ATCGAAGAAG TCAGCAGCGG TCCGCGTCCG ATTGAGGGGG TTGGCACGGC AATGGCGGCG TTCGTCGGGT TCGCCGCTGC CGGTCCCGTT AATCAACCTG TGCTGGTGAC CAGTTGGACG CAGTATGTCG AGAAGTTTGG TCGTCTCGAT GAAAGCGGGC GGCGCAACCC ACACATGGAT GGCGCCTATC TGTCGCATGC CGTCTACGGC TACTTTCTCA ACGGCGGCGG TCGGTGCTAT GTGACGCGCA TCCCGCAGCA GGCGGACGGC AAAGCGCCTC CTCCGCCGCG CCTCGAACTC CCGACGCGGG CCTCGAAGGC GCTGACCTCG CTGATTGTCA CACCCAAGAG CGAAACTGCC AGCGACATTC AAGTGGAGAT CGGTCCGCCG GTTGGCGAAA ATCCGCCTCC CGAAGCGTTT ACGGTCAAAA TCAGCATGGG GGAAGTGAAG GAAGTCTACG AGAATGTGTC GTTCAACAAA CGACCAAAAG ATGGAACCTC TTACGTGGTC GAGAAGATCA ACAGTTCCAG CACGCTGGTG CAGGTCGCTG AAGGACCGGC GACCGGCTCG CTGGCGGACC GTGTGCCGGA GTTTGGCATG TCGGTCATCA AGCCGCTGGC GCCGATCGTT CCGGCGCGCG TGGATGCGAC GACATTCGTC GGTAGCGCCG CCGAGCGCAG CGGTGTCGAG GGATTGGAGA TCGCCGAGGA TGTGACCATG ATCTGCGCGC CAGACCTGAT GGCAGCCTAT CAGTCGGGCG CAATCACGAA AGAAGGAGTC AAAGCTGTCC AACTGGCGAT GATTGCCCAC GCCGAACGCA TGCAGGATCG CATGGTCATT CTCGATCCGC TTCCTGGTCT GACGCCGCAG CAGGTCAAGC AGTGGCGCGA GCGCGACACG AACTACGACT CGAAGTTTGC CGTGCTCTAC TACCCCTGGC TCAAAATCAT GGGACCGGAC GGCAAGACGG AGATGGAGAT TCCGCCGTGC GGACACATTG CGGGCATCTG GGCGCGCAAC GACAATACGC GCGGCGTCCA CAAAGCGCCG GCGAACGAGG TTGTTCAGGG TGCGCTTGGA CCGGCGATTG CGATCACGAA AGGCGAGCAG GATGTGCTCA ACCCGATTGG GGTCAACTGC ATCCGGTCGT TCACCGGTAT GGGGTTGCGG GTCTGGGGTG CACGCACCCT CTCCAGCGAT GCCGCCTGGC GCTACGTCAA TGTGCGGCGT CTTTTCAACT ACGTCGAGAA GTCAATCGAA CGCGGGACGC AGTGGGTTGT CTTCGAGCCG AACGACCCCA ACCTGTGGGC GCGCGTCAAG CGTGACGTGG AAGCGTTCCT GACCGTCTGC TGGCGTGATG GCATGCTGTT TGGTCTGACA CCGCGCGAGG CGTTCTATGT CAAGTGTGAC GAAGAACTGA ACCCGCCCGA AGTGCGCGAT CAGGGCAAAC TGATCATCGA AGTCGGGCTG GCGCCAGTCA AACCCGCCGA GTTCGTCATC TTCCGCTTCA GCCAGTTCGC TGGCGGCGGG GCATAA
|
Protein sequence | MPEYLSPGVY IEEVSSGPRP IEGVGTAMAA FVGFAAAGPV NQPVLVTSWT QYVEKFGRLD ESGRRNPHMD GAYLSHAVYG YFLNGGGRCY VTRIPQQADG KAPPPPRLEL PTRASKALTS LIVTPKSETA SDIQVEIGPP VGENPPPEAF TVKISMGEVK EVYENVSFNK RPKDGTSYVV EKINSSSTLV QVAEGPATGS LADRVPEFGM SVIKPLAPIV PARVDATTFV GSAAERSGVE GLEIAEDVTM ICAPDLMAAY QSGAITKEGV KAVQLAMIAH AERMQDRMVI LDPLPGLTPQ QVKQWRERDT NYDSKFAVLY YPWLKIMGPD GKTEMEIPPC GHIAGIWARN DNTRGVHKAP ANEVVQGALG PAIAITKGEQ DVLNPIGVNC IRSFTGMGLR VWGARTLSSD AAWRYVNVRR LFNYVEKSIE RGTQWVVFEP NDPNLWARVK RDVEAFLTVC WRDGMLFGLT PREAFYVKCD EELNPPEVRD QGKLIIEVGL APVKPAEFVI FRFSQFAGGG A
|
| |