Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4009 |
Symbol | |
ID | 5901471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4339907 |
End bp | 4341127 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641564530 |
Product | hypothetical protein |
Protein accession | YP_001685632 |
Protein GI | 167647969 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.281652 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCTT GGTCCAGGCT GCTCGGAGGG CTGCTGGGAT TGGCGGCGCT CGCCGCCGCG CACGGCGTCG CTCTGGCCCA GGCGCCGGAC GCCAGCGTCC AGCATCCAGA CTATGCCGAT CCAGGGCTGT GGCTGTGCCG GCCCGACCTG GCGGACAACC GCTGCAAGGT CGACCTCGAC GCCACGGTGA TCGCGCCCAG CGGCAAGATG ACGGTCGAGC GCTACGTCCC GGCCAAGGAC CCGAAGATCG ACTGCTTCTT CGTCTATCCC ACGGTCTCCA ACGATCCGGG TTGGATCTCG GACTTCTCGC CCGACGCGGC CGAGTGGGAC GACATCAAGG TGCAGTTCGC CCGCTTCGGG TCGGTCTGCC GGCAGTTCGC GCCGCTGTAT CGCCAGGGGA CGCTGCGGCG GCTTCGGGCG CCGAGCGGCG GGCCGGCCCC GGTGGGGGCG CAACCGGCGC CGGGCCTTGG CGGCTTCTCG GACGTGGTCG ACGCCTGGGC CTGGTACATG GCCAACGAGA ACAAGGGCCG GGGCGTCGTC CTGATCGGCC ACAGCCAGGG CGGCCTCATG ATCACCCGGC TGATCGCCCA GGAGATCGAC GGCAAGCCCG TCCAGAAGCA GCTGATTTCC GCCCTAATCC TGGGGGCGCC GGTCATGGTC CCTCCCGGCA AGGACGTCGG CGGTTCGTTC ACGTCGGTCC CCCTGTGCCG CACCGACACC CAGGTCGGCT GCGTGATCAC CTACGTGACT TTCCGCGACC GCCTGCCGCC GCCTTCGACC TCGCGCTTCG GCAAGGCCCG CGACGGGCTG CGCGCCGCCT GCGTCAATCC GGCCAGCCTG GCCGGCGGCT CGGGCCAGCC GGAGTCCTAT TTCATCACCA ACGGTTTCCT GAACGGCTCG GGCGGCGACC TCCAGCCTGA ATGGGTGCGG CCGATGCGGC CGATCGGAAC CTTCTTCGTC AAGGCGCCGG GGCTGGTCTC GACCGAATGC GTCGAGAGCG GTGATTTCAA CTACCTGGCC CTGCACGTGA ACGGCGATCC AAGGGATCCG CGCACCGACG AACTGGGCGG CCAGATCATC CGCCACACCG GCGTCGACCT GTCGTGGGGG CTGCATCTTC TCGATGTCGA TCACTCGATC GGCACGCTGA TCCGCATCGT TCGCAGGCAA GGGGAAACCT ACGAGACGGG CGAGCGCAGA GCGGGTTCGC ATCAATACTG A
|
Protein sequence | MKAWSRLLGG LLGLAALAAA HGVALAQAPD ASVQHPDYAD PGLWLCRPDL ADNRCKVDLD ATVIAPSGKM TVERYVPAKD PKIDCFFVYP TVSNDPGWIS DFSPDAAEWD DIKVQFARFG SVCRQFAPLY RQGTLRRLRA PSGGPAPVGA QPAPGLGGFS DVVDAWAWYM ANENKGRGVV LIGHSQGGLM ITRLIAQEID GKPVQKQLIS ALILGAPVMV PPGKDVGGSF TSVPLCRTDT QVGCVITYVT FRDRLPPPST SRFGKARDGL RAACVNPASL AGGSGQPESY FITNGFLNGS GGDLQPEWVR PMRPIGTFFV KAPGLVSTEC VESGDFNYLA LHVNGDPRDP RTDELGGQII RHTGVDLSWG LHLLDVDHSI GTLIRIVRRQ GETYETGERR AGSHQY
|
| |