Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3807 |
Symbol | |
ID | 5901269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4127866 |
End bp | 4129941 |
Gene Length | 2076 bp |
Protein Length | 691 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641564329 |
Product | hypothetical protein |
Protein accession | YP_001685431 |
Protein GI | 167647768 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3593] Predicted ATP-dependent endonuclease of the OLD family |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0465768 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.185185 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAATCA AGCACGTGGA GATCGAAAAT TTTCGGCTCC TGCGAAAGGT TGCGATAGGA CTCGAAGAGC GGACCACGCT CATCGTTGGT CGCAACAACA GCGGCAAGAC ATCGATCGCC GAGCTCTTCC GCCGGCTCCT GTCCGAACGG ACCCTAGCGT TCAAACTGGA GGATTTTTCC CTCGGTTGTC ATGAGTGCTT TTGGACGGCA TTCGAAGCTT TCCGGGCAGG CGCAGATTCC TCCGATGTCC GCAGCCACCT CCCGTCGATT ACGATCGCGC TCGACATCGC CTACGATGTT GATGCGCCGG ACCTGGGGCC GCTGAGCGAC TGCATCATTG ATCTCAATCC CGACTGTACG GAGGCGAGGC TGGTGCTCAC CTTTGGCCCG CGAACGACGG CAGCCGAGAC TTTGTTCGCC GGCATCGTTG CGCCTGGGGA AGACGTAGCT GCCGACCGTA TCGCGCTGTT TCGGGCTCTC GGCAGCCGGT TGCCGGCGGC TTATGCGTCG TCGCTTGAGG CAGTCGATCC GAATGATCCG ACCAATCGAA AGACCCTGGA GCCGAAGACG CTCACCGCCC TGATCCACGG GGGGTTCATC AACGCCCAGC GTGGACTGGA TGACGACACC CATCGCGAAC GCGATGTGCT CGGCAAGGTC GTTGAAGTGC TCTTTCAGTC GGCGTTGACC GATCCGCTTG ACCCGGAAAA GCGAACGACG GCCGAGCAAT TGAAGGCGGC CGTGGAACAG ATTCAGGGAG ATTTACATGT CGGCTTCAAT GCGAAGCTCA CTTCCTTGCT GCCAACGTTC GATCTGTTCG GCTATCCAGG CTTGGCTGAC CCCGGGCTCG TTACCGAAAC GAGCTTCGAT GTCGACAAGC TCCTGAACGA CCATACCAAG GTCCGCTATG TCGGCGTAAA CGGCGTCACT TTGCCCGAGA CCTACAATGG GCTCGGCGTA CGCAACCTAG TCTACATGCT GCTTCAGCTG CTCCGTTTCT TCCGTGAGTA TCAAGCAAGT CCAGCGGCGG CGGGCGTGCA TTTGGTCTTT ATCGAGGAGC CTGAGGCCCA TTTGCATCCG CAGATGCAAG AGGTTTTCAT TCGGCAGCTG GACCAGATCA CGAGCGCCTT CGTCGCGCAG CTTAACGAGA ACCGTCCGTG GCCCGTGCAG TTCGTCGTGA CCACGCACTC TCCTCACATG GCGAATGAAG CGCGATTTGA GTCCATGCGG TATTTCCTGT CTGTGGCGGA CGGCAACGGT TTACGTCGGT CGGTCATCAA GGATCTGCGG CAGGGAATGG GCAATGCTCC GGCCCCTGAC CGTGAATTCC TGCACCAATA TCTAACCCTG ACTCGATGTG ACCTATTCTT CGCTGACAAA GCCGTGCTCA TCGAAGGCAC TTCGGAGCGG CTACTGCTCC CAGCAATGAT CCGGAAGACT GATGCTGCAG CGGCTGGGGA AGCGCAGCTC GGGAGCCAAT ATCTGACAGT AATGGAGGTC GGTGGCGCCT ACGCGCATCG GTTTTTCGAT CTTCTGGCCT TCCTTGAGCT GCGCGCGCTG ATCATCACCG ACATCGACAC TGTGAAGCCC AATGACAAAG GGAAGCACGT GGCGGCCCTC GTCGCCGACG GTCAGTTCAC CAGCAACGGC TGCATCAAGG CCTGGTTCGA AATTGCAGTG TCGCCGGCTG CATTGTTGGC GAAGACGTCA CCGGACAAGA CGGTTGGCAG TCGTCGGCTT GCCTATCAAA TTCCGGAGAC GGACGGCGGG CCGTCCGCGC GTAGCTTTGA GGATGCGTTC ATTCTTGCAA ATCCCGAACG CTTCGCACTC GGGGATGGTG ATGCGGCCGC ACTCGCCTAT GAGCACGCGG CCGAGCAGAA AAAGTCGACA TTCGCATTGG AACACGCGAT CGAACACACG GACTGGAATG TGCCCCGCTA CATTTCGGAA GGCCTGCGGT GGCTGGCCCA GGGCAACCCT GCTCCGTTGG AGCCTCCGCT GGCGGTCGCG GTCGAGATTG TTGCGGGCGT GGCCGGTGTC GCGGTGAACG TCGCCGAGGT TGCCGGAAAT GGCTGA
|
Protein sequence | MRIKHVEIEN FRLLRKVAIG LEERTTLIVG RNNSGKTSIA ELFRRLLSER TLAFKLEDFS LGCHECFWTA FEAFRAGADS SDVRSHLPSI TIALDIAYDV DAPDLGPLSD CIIDLNPDCT EARLVLTFGP RTTAAETLFA GIVAPGEDVA ADRIALFRAL GSRLPAAYAS SLEAVDPNDP TNRKTLEPKT LTALIHGGFI NAQRGLDDDT HRERDVLGKV VEVLFQSALT DPLDPEKRTT AEQLKAAVEQ IQGDLHVGFN AKLTSLLPTF DLFGYPGLAD PGLVTETSFD VDKLLNDHTK VRYVGVNGVT LPETYNGLGV RNLVYMLLQL LRFFREYQAS PAAAGVHLVF IEEPEAHLHP QMQEVFIRQL DQITSAFVAQ LNENRPWPVQ FVVTTHSPHM ANEARFESMR YFLSVADGNG LRRSVIKDLR QGMGNAPAPD REFLHQYLTL TRCDLFFADK AVLIEGTSER LLLPAMIRKT DAAAAGEAQL GSQYLTVMEV GGAYAHRFFD LLAFLELRAL IITDIDTVKP NDKGKHVAAL VADGQFTSNG CIKAWFEIAV SPAALLAKTS PDKTVGSRRL AYQIPETDGG PSARSFEDAF ILANPERFAL GDGDAAALAY EHAAEQKKST FALEHAIEHT DWNVPRYISE GLRWLAQGNP APLEPPLAVA VEIVAGVAGV AVNVAEVAGN G
|
| |