Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3795 |
Symbol | |
ID | 5901257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4111503 |
End bp | 4112744 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641564317 |
Product | integrase family protein |
Protein accession | YP_001685419 |
Protein GI | 167647756 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.000111159 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.000000215958 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGCCAAGCC TGACCGACAC CGCCATCCGA CATGCGCTGA AGCGCGTCGA GATGAGCCAG AAGCAAGAAA ACCTCGCCGA CGGCGAAGGG CGCGGCACCG GCCGTCTCGT CCTCGTCCTG AAGCCCATGC CCAAGCGCGT GACAGCCGAC TGGATGGCCC AGCAGTGGCG CGATGGGAAG CGGACCAAGA AGAAGCTCGG CGCCTACCCG TCAATGACGC TCGGCCAGGC CCGCGAGATT TTCAAACGCG ACTATGCAGA TGTGATCCAG AAGGGTCGAA GCATCAAGAT CGCCACCGAT ACGCGCCCTG GCACCGTCGC CGATCTTTTC GAGGGCTATG TCGCGTCGCT CAAGGCCGCG AACAAACCCT CTTGGAAAGA AACCGAAAAG GGTCTGAACA AGATTGCCGA TACGCTCGGG CGCAACCGTC TCGCCCGCGA GATCGAGTCC GAGGAAATCA TCGAACTGAT CCGCCCGATC TATGATCGCG GCGCCAAGTC GATGGCCGAT CACGTCCGAT CGTATATCCA CGCCGCGTTT AGCTGGGGCA TGAAGTCCGA TAACGATTAT CGGCAGCAAT CCGCTCGTCG CTTCCGCATC CCGTTCAATC CAGCAGCAGG CATTCCAACC GAGCCCAAGG TCCAGGGCAC GCGCTGGCTC GATGAGGACG AGTGGGTCCA GCTCTATCGC TGGCTCGAGT GCCCCGATAC GCCTGTCCAT CCTCCGTATA CCCGTGCGGT CCGAATCCTC ATGTTGACCG GTCAGCGGGT CGAGGAGATC GCGAACCTTC ACATCGATCA ATGGGACGCT AAGGAGAAGA TTATCGACTG GTCCAAGACT AAGAACGACC AACCTCACGC CGTGCCGGTG CCCGCTCTCG CCGCAGAGCT ACTCTCCTCC ATCAACGTCA ACGAGTACGG CTGGTTCTTC CCTTCGGCCA TGGATCCCTC CAAGCCCGTC AGCCATGGCA CGCTCTATTC GTTCATGTGG CGCCAGCGGG ATCGTGGTGT GATTCCCTAT GTGACGAACC GCGATCTGCG TCGAACCTTC AAGACACTCG CCGGCAAGGC AGGCGTGCCG AAGGAGATCC GCGACCGCCT GCAGAACCAC GCCTTGCAGG ACGTCAGCTC CAAGCACTAC GACCGCTGGA ACTACATGGT GGAGAAACGT GCCGGCATGG CGAAATGGGA CAAATTCGTC CGCGCCATGC TTGCGAAGAA GCGCATGAAG GCGGCCGCAT GA
|
Protein sequence | MPSLTDTAIR HALKRVEMSQ KQENLADGEG RGTGRLVLVL KPMPKRVTAD WMAQQWRDGK RTKKKLGAYP SMTLGQAREI FKRDYADVIQ KGRSIKIATD TRPGTVADLF EGYVASLKAA NKPSWKETEK GLNKIADTLG RNRLAREIES EEIIELIRPI YDRGAKSMAD HVRSYIHAAF SWGMKSDNDY RQQSARRFRI PFNPAAGIPT EPKVQGTRWL DEDEWVQLYR WLECPDTPVH PPYTRAVRIL MLTGQRVEEI ANLHIDQWDA KEKIIDWSKT KNDQPHAVPV PALAAELLSS INVNEYGWFF PSAMDPSKPV SHGTLYSFMW RQRDRGVIPY VTNRDLRRTF KTLAGKAGVP KEIRDRLQNH ALQDVSSKHY DRWNYMVEKR AGMAKWDKFV RAMLAKKRMK AAA
|
| |