Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3520 |
Symbol | |
ID | 5900975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3796322 |
End bp | 3797605 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641564026 |
Product | hypothetical protein |
Protein accession | YP_001685145 |
Protein GI | 167647482 |
COG category | [S] Function unknown |
COG ID | [COG5361] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGTC GGCAGCTATT CGTGGGCGCG AGCGCGCTGG GCGTGATGGG GCTTGGCGCC GCCGCCGTGG CCCAGGGCCT GCGGATTTCG ACCGCCGCCC AGGCCTGGCT CTACGCCCTG CCGATGATCG AGATGGCCAC CACCCGGGCG CGCGTGCTCA AGGCCCCCGG CGCGGCGATC AACAGGCTGG CCCATGGGCG GGAGCTGTCG GATCACACCG CCCGACGCGT GACCACGCCC AACAACGACA CCCTCTATTC CATCGCCTTT CTCGACCTGA GCCAGGGGCC GGCCACCCTG ACGGTTCCCG CGACCGGCGC GCGCTACTGG TCCGCGGCGA TCATGGACAT GTTCACCAAC AACAACGCCG TGCTGGGCCT GCGCACGGTG GGCGGCGAGG GCGGGGCCTT CACCCTGGTC GGGCCGGGCC AACCCGCCAA GGGTCCCAAC CCCGTCCGCG TGGCCACGCC CCACGCCTGG CTGCTGATCC GCACCCTGGT CGTCGACGAG GCCGACCTGC CGGCGGCGCG CAAGGTCCAG GACGGCTTCG TGCTGAGCGG CCCCATGGCC GCGCCGCCGC CGGCCTATGC CGCGCGCGAC GCCGAGGCCG GCGACTACTT CGCCGCGGCC CGCGCCCTGC TGGCCGCAGA TCCGCCGCCC GCCACGGACC AGAAGCTGCT GCGCAAGACC GCCGCCTTCC TGGGCGCGGG TCCGTTCGAC GCCGGGGCGG CGCGGACCGG CGCCCAGGAA GCCCAGATGA TCACCCGCTT CGCCAAGGGC CGGCAGACCT TCACCGACGG CTGGGCCTAT CCGCGCGCCA ATCTGGGCGA CTACGGCCAG GACTACACCT ACCGCGCCAT CGTCGCCCTG ATGGGGCTTG GGGCGCTGCC CGTGGCCGAG GCGATGTACA TGAAGGCGGC CGGCGATGAC GGGGCGGGCC TGTTCACCGG CGACGGCCTC TACCGGCTGA GCCTGCCGGC CGACATGCCG CTGGACGGCT TCTGGTCGCT GTCGATGTAC GAGGCGACGG AAGACGGCCA GTTCTTCTTC ACCGACAATC CGCTGAACCG CTACGCGATC GGCGACCGCA CGGCGGGGCT GGAGCGCGAG GCCGACGGCT CGCTGAACCT GTGGATCGGC CGGACGGACC CGGGCGGAGA GCGTTCATCC AACTGGCTGC CCGCGCCCAA GACCGGGCCG TTCGCGATGT ATCTGCGGAC CTATCTGCCG CGCGCGGAAC TGCTGGACGG GCGGTTCCGG TTCAAGCCGG TGGAGAAGGT CTAA
|
Protein sequence | MNRRQLFVGA SALGVMGLGA AAVAQGLRIS TAAQAWLYAL PMIEMATTRA RVLKAPGAAI NRLAHGRELS DHTARRVTTP NNDTLYSIAF LDLSQGPATL TVPATGARYW SAAIMDMFTN NNAVLGLRTV GGEGGAFTLV GPGQPAKGPN PVRVATPHAW LLIRTLVVDE ADLPAARKVQ DGFVLSGPMA APPPAYAARD AEAGDYFAAA RALLAADPPP ATDQKLLRKT AAFLGAGPFD AGAARTGAQE AQMITRFAKG RQTFTDGWAY PRANLGDYGQ DYTYRAIVAL MGLGALPVAE AMYMKAAGDD GAGLFTGDGL YRLSLPADMP LDGFWSLSMY EATEDGQFFF TDNPLNRYAI GDRTAGLERE ADGSLNLWIG RTDPGGERSS NWLPAPKTGP FAMYLRTYLP RAELLDGRFR FKPVEKV
|
| |