Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4973 |
Symbol | |
ID | 5902435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 5377127 |
End bp | 5378287 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641565494 |
Product | hypothetical protein |
Protein accession | YP_001686591 |
Protein GI | 167648928 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.351891 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAGCA GCTTCGACAC CCTGAACGAG ATGGATCCCG CGACGCGGGC CTTTGCTGAG GTGCGCGCAC GCGAAGCCGG AATGAGCCTC GCCGAATGGC TGGGGGCTTT AGTCCACGAA CGCGCCGCGC CGCAGGTGCC GCGTCGCTTT TACGGTGGCG GAGGACCAAT CTGGGCTGAA GGCCCGGGGG CTGGTCGTTC GCTGGCGATA TCGATGGGCG CTCGACCAGC CTCCGCCTAC GACATGGACG TCGCGCACTA CATGCCGATC CTGAGATACC CAGAGATCGA GAGGGGCGAT CTCAATCGTA TTTTTCTCGA CCACGAGGTT CATGAGGTGA CGCGGTCGGC CCTTCACATG ATCGCATTGT TTCCGCCACG GATCAGCCAT GCGGCAATCC TCCTTAGTGC GGCGTTTGCA GAAGCAGACG TGCAGATCCG GTTGCCGCCG TACCGCCCGA GCGGCCACCA CAAAGCCTCA GGCCGTTGGT ATCACACGCT CGTCGATACC TGCCTGCATC GCTCGCGGGA GGTTCGGAAT GCCTTGATGC ATGAATGGAC GCCCAGTGGG CGAGCTGAGC TCATGTGGGG CGACGCCTTC GCCACCCTTT GCGACCTCAC GTCGGCGCGC CAAGAACTCC TGGTCATGTT TCACCACGAT CGGAGCCGGG CGTCTGTCGA ACTCGCCAAA CACCTCGACG TTGCGACCCA GCAGTTCGAG TTGTCCGCCA ATCTTTTGGC GGAGGTCGAT ACGAGCGCGG CAGAGCCGAC TGAACAGCAG CGCTTCACTC GGCTGCGTGA GGCTTTGCTA GAGCGGGCTG GCGGCGGCGT GTCCCTGACC GAAGGCGCAA AGCTTCTCAA TGTCTCACGC CAGGCGCTCC ACAAGCGGAT CAAGGCCGGT ACGGCGCTAG GCATGATGGA TGGCGACGAG CTCGTCTTGC CGCGTCTGCA GTGGGTCACA AAGGGGGACG AAGTCCAGTT TCTCCCTGGA CTGACCGATG TGGTGAGGCA GTTCGAGCGC GCGGGCGGAT GGTCGGCGCT GCAGTTCCTC CTCGATCACG ATCCCAATCT CGCAAAGCCG CCGATCCAGG CGCTGCGCGA AGGCTCTCCT GAGAAGGTGG TTGCGGCCGC GCGCGCCTAC CTTGGCCTGG ACGAGGAATG A
|
Protein sequence | MMSSFDTLNE MDPATRAFAE VRAREAGMSL AEWLGALVHE RAAPQVPRRF YGGGGPIWAE GPGAGRSLAI SMGARPASAY DMDVAHYMPI LRYPEIERGD LNRIFLDHEV HEVTRSALHM IALFPPRISH AAILLSAAFA EADVQIRLPP YRPSGHHKAS GRWYHTLVDT CLHRSREVRN ALMHEWTPSG RAELMWGDAF ATLCDLTSAR QELLVMFHHD RSRASVELAK HLDVATQQFE LSANLLAEVD TSAAEPTEQQ RFTRLREALL ERAGGGVSLT EGAKLLNVSR QALHKRIKAG TALGMMDGDE LVLPRLQWVT KGDEVQFLPG LTDVVRQFER AGGWSALQFL LDHDPNLAKP PIQALREGSP EKVVAAARAY LGLDEE
|
| |