Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3894 |
Symbol | |
ID | 5901356 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4214649 |
End bp | 4215989 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641564415 |
Product | hypothetical protein |
Protein accession | YP_001685517 |
Protein GI | 167647854 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.213149 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.698237 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGGGG TCGCCGCCAA CAAGTTGCTG ATGGTTCGGA AGCTGGTGGA AACCGCGCCC GACGCCGCCT TGCGTAGCCT CGAGCTCGCC CTGTCCGGCC CGGCGGGCGG GCAGGGCGCG CTGGCGGCCG TGCGCGGGTT GGTCGAGGAC GAAACCGCCG CCCGTTACGT CCGCAACAGC GTGCTGGCCC CGATCGCGCC CCTGTGCGTC AAGCGCGACA CCGAGCAGAC CTCCTTCCCG CCCCGCACCC TGGCCCTGCT GTGGGCCGCC CTGAGGGCCG AAGCCCCCAA GCAGGTCGAA GAGGCCGCCG CCCGCTGCAA TCCGTGGGAC CTGGAGCAGG GTCCGCCGGA CGTCTTCGAC GAACTTTGCA AGATCGCCGC CAAGGGCCTG CGGGCCCAGG CCGCCCCCGG CTTCCTGGCC CTCGACGCGA TCTGCGACAT CGACGAGCTG GCGTCGTGCC TGGAGCTGTC GCACATCGTC CGCGCGGCCC TGCCAAAGCT GTCGGAATGG GTCAGCCGGA TGAGTGACGA GCGTGCCTCC GCCGCTCGCC TGGCCTACAA GGACGCCTGC ACCATCCGCC CGGACGCCGG GCCTCTGCTG TTCGAGATGA TGGCGGCGCA CCTGCCCGAC GACTGGCGGA TCCTGCGCGT GATCTCGGCG GTCATGGACC GGCCCGGCGA CAGGTTCTGG GCCTCTTCGG AGGTCAGCGT GTTCGGCGAA CGGGTGCTGG CCGACATCGA GAAGAACATC GACTACATCC AGGGCTTCGA CGCGGACAAG GGCGAGGCCG AGGGGCGCAA GGCCGCGCTC GCCGCCCAGA AGGTCTCGCA GGAGATCACC GAGTTCGAGC AGTCGGTGAA CCTGGCCAAG GACGGCCCGT GGGGCCGGCG GATTTCCAAG CACAAGCAGG GCGTCGCCCA GGCCGTCGAG AGCCGGATGA ACAAGGCCGA GAACGAGCTG CTGGCGGCCT TGCCGCTGCG GCCGATCTCG ATCCTCGGCG GCAAGAAGGG CAAGGGCGTT CCACAACTGG TCGTCGAACC GGATCCGGCG GCGCACCGCC GCGCGACCGC CGCCCTGGCC TTCATCGCCG ACGTGCGCAG TTGCGCCATG CAGAGCGGCT ACGGCGCCAG CCGCGCCAAG GCCCTGGAAA AGATCAACAG CCGCCTGGAC CAGTATATCG AGGACATCCT GCACGTGGTC CGCACCGGCG ACGGCGGCGA CCCGGTCCTG GCCCGGCTCT ATGTCGACAT GGCCGCCGGC TACATCGCCT TCAGCCGCGA CGAGAAGACC GCCGAGATCG TCCGCCGCCG CGCCGCCGCG GCGATGGCGG CCGCGGCCTA G
|
Protein sequence | MAGVAANKLL MVRKLVETAP DAALRSLELA LSGPAGGQGA LAAVRGLVED ETAARYVRNS VLAPIAPLCV KRDTEQTSFP PRTLALLWAA LRAEAPKQVE EAAARCNPWD LEQGPPDVFD ELCKIAAKGL RAQAAPGFLA LDAICDIDEL ASCLELSHIV RAALPKLSEW VSRMSDERAS AARLAYKDAC TIRPDAGPLL FEMMAAHLPD DWRILRVISA VMDRPGDRFW ASSEVSVFGE RVLADIEKNI DYIQGFDADK GEAEGRKAAL AAQKVSQEIT EFEQSVNLAK DGPWGRRISK HKQGVAQAVE SRMNKAENEL LAALPLRPIS ILGGKKGKGV PQLVVEPDPA AHRRATAALA FIADVRSCAM QSGYGASRAK ALEKINSRLD QYIEDILHVV RTGDGGDPVL ARLYVDMAAG YIAFSRDEKT AEIVRRRAAA AMAAAA
|
| |