Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4431 |
Symbol | |
ID | 5901892 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4799978 |
End bp | 4801207 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641564949 |
Product | hypothetical protein |
Protein accession | YP_001686049 |
Protein GI | 167648386 |
COG category | [R] General function prediction only |
COG ID | [COG2081] Predicted flavoproteins |
TIGRFAM ID | [TIGR00275] flavoprotein, HI0933 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.803445 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.770543 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGAAC TCCAGACTCC CGACGTCGCC GTGATCGGCG GTGGGCCGGC CGGGCTGATG GCGGCCGAGA TGCTGAGCGC GGCGGGGCTG TCGGTGGCGG TGTTCGAGCG CATGCCGACC CTGGGGCGCA AGTTCCTGAT GGCCGGGCGC GGCGGGCTGA ACCTGACCCA TTCGGAAGAC CTTGAGCGGT TCGTGGCGCG CTACGGCGGC GCGAGCGAGC GGCTGCGGCC GATGCTGCAG GCCTTCACGC CAGCCGATCT CGTCGCCTGG GCCGAAGGGT TGGAGCAGGA AACCTTCGTC GGCACCAGCG GTCGGGTGTT TCCCAAGGCG CTGAAGGCCT CGCCGCTGCT GCGGGCCTGG ATCGCGCGGC TGGAGGGGCG TGGCGTGGCG CTCAACACCC GCTCGACCTG GACGGGCTGG AACGCGGCCG GCGACCTGGT CTTCGACACG GCGGACGGCG TTCGGACCGT GCGGCCGCGC GCCACCATCC TGGCCGTCGG CGGGGCCAGT TGGGCCAAGC TGGGGTCGGA CGGCGCCTGG GCGCCGCTGC TGGCCGCGCG CGGGGCGTCG CTCGCGCCGT TCAGGCCGGC CAATGTCGGC TTCGCAGTCA CTTGGACGAA GGTGTTCCGC GAACGCTTCG CCGGCGCGCC GCTGAAGAAT ATCGGCCTGA GCTTCGAGGG TCAGGCCTCG CGGGGCGACG CCCTGGTGGC GGCCTACGGC CTGGAGGGCG GGGCGGTGTA CGCCCTGTCG GCGGCTCTGC GCGACGCGAT CCTGGCGCGA GGCTCGGCGA CCCTGGACAT TGACCTGCGT CCCGACGTCC CCCTGGCCCA ACTGACCGCG CGCCTGTCCA GGCCGCGCGG CGGGCAGTCG CTGTCGAGCT GGCTGCGCAA GGCCGCCCAC CTGTCGCCGG TCGAGATCGG CCTGCTGCGT GAAGCCCACG GCATGGCCCT GCCGGTCGCG CCCGACGCCC TGGCGGCGGC GATCAAGGCC GCGCCGATCG TGCTGACCGG AACGCAGGGG CTGGAGCGGG CCATCTCCTC GGCCGGCGGC CTAAGCTTCG AGACCCTCGA CGGCCTGGCG TTGAAAGGCG CGCGAGGGGT GTTCGCGGCG GGCGAGATGC TGGACTGGGA GGCCCCGACT GGCGGCTACC TGCTGCAGGC CTGTTTCGCG ACCGGGGTGG CGGCGGCGCG CGCGGTGGTG GAGCATCTTC AGGCCTGCGG TCGAGCGTGA
|
Protein sequence | MTELQTPDVA VIGGGPAGLM AAEMLSAAGL SVAVFERMPT LGRKFLMAGR GGLNLTHSED LERFVARYGG ASERLRPMLQ AFTPADLVAW AEGLEQETFV GTSGRVFPKA LKASPLLRAW IARLEGRGVA LNTRSTWTGW NAAGDLVFDT ADGVRTVRPR ATILAVGGAS WAKLGSDGAW APLLAARGAS LAPFRPANVG FAVTWTKVFR ERFAGAPLKN IGLSFEGQAS RGDALVAAYG LEGGAVYALS AALRDAILAR GSATLDIDLR PDVPLAQLTA RLSRPRGGQS LSSWLRKAAH LSPVEIGLLR EAHGMALPVA PDALAAAIKA APIVLTGTQG LERAISSAGG LSFETLDGLA LKGARGVFAA GEMLDWEAPT GGYLLQACFA TGVAAARAVV EHLQACGRA
|
| |