Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5331 |
Symbol | |
ID | 5897119 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010333 |
Strand | - |
Start bp | 40137 |
End bp | 42113 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641550623 |
Product | TRAG family protein |
Protein accession | YP_001672109 |
Protein GI | 167621601 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3505] Type IV secretory pathway, VirD4 components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.390385 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCC GCCCCAAGAT CATCCTGCCG CTAGCAGCGG CCAGCGTGTT GATTGCCCTA TCGATCGCCA CTCAGATCGT CGCTCACGAC TTCCACTACC CGCGTGAGTT CGGCCACGGC CTGCTGGATG TGGGGCAGGC GAGGATCTAC GCGCCGTGGG CGTTCATTGG CTGGTATGGG CGCTTCGCGG CGCGCTACCA GCAGGCCTTC GACATGGCGG CGATGATCGC GCTCGCGGCC GTGTTCGTCC CGTCGATGTT GCTGATCGGG CTGACCAAGA GCACGCGCCG GGCGCCGCGA GAGTTTGGCA AGGACGCCTG GGCGACCGAG GCCCATGTCC GCAAGGCCAA GCTCGTCCAT GGTGACGGCC AGATCAGCGG ACGGGTGCTC GGCCGGTTCA ACGGCAAATA CCTCACCTAT CGCGGTGTGG AGCACGCCAT CATCGTGGGC GCGTCCCGCA GCGGGAAGGG GGCCGGCCAC GTCGTTCCCA CCCTGATCGC CTGGCCGCAA AGCGCCTTCG TCTACGACCG CAAAGGGGAG CTTTGGCACA TCACGGCCGA TCACCGGAAG ACCTTCAGCC ACGTCTTCTA TTTCGCGCCG ACCGACCCCA ACACCGTGCG ATGGAATCCG CTGTTCGAGG TGCGGAAGGG GCCGATGGAG ATCGCCGACA TCCAGAACGT CGTCGGCATC CTGGTGGACC CGCTGGGCCG AAAGGCGGGC GACCTCAATT TCTGGGACCA GAGCGCGACG GACTTCTTCA CCGCGATCAT CCTGCACGTC CTCTACAGCG AGGAGGACAC CAAGAAGAAC CTCGCCCAGG TCCGCCGCCT GCTGATCAAT ATCGATCCGA CCCTTCATGC GATGAAGCAC ACCAAACATC GCCACAGACC GGACCTTCAT GCGCCGGGCG GGCTGGCGCG GGGCGCCGAC GGCAAGCCCA TCGCCGAGGT CCACCCGGAG ATCCTGCTGG GCGCCACGGC GCTGGACAGC ATGGACGAGC GGGTGAAGTC CAATGTGCTG GCCACCTGTC GGGCGTCGCT ATCGCTGTGG GCCGACCCCT ACGTGGAATA CGCCACCAGC TGGTCGGACT TCTCGATCGG CGACCTGGTG TGCTCGGAGA GCCCGGTCAC CTTTTACATC ATCACCCCCC AGGCCCATGC CGACCGCCTC GCCTTTCTGG TGCGGGTGTT CACGCGCCAA ACGATCAACA GCCTGATGGA ACGCGAGCAT TTCGACAGCC GGGGGCGGCG CAAGGCGCAT CGACTGCTGC TGCTGCTCGA CGAGTTTCCC AAACTTGGCA GCCTGCCCTT CCTGGAAAAC GCCATGGGCG AAATGGCCGG CTACGGCATC ACCGCCCACC TGATCTGCCA GAGCTTCAAC GACGTGTTCT CCAAGTACGG GGACAAGACG CCGATCTTCG ACAACATGCA CATCACCGCC ACCTTCGCGA CCTCGGAGCC TACGAGCATC GACAAGGTGA TCCGGCGCGC CGGCAAGGCG CTGGAGATGC GCGAGAGCTA CAGCGATCCG CGCAGCATCT TCGGCAGCTC GCACCGCTCG ACCTCCCAGA GTGAGCACGA GCGCTACATC CTGACCGAGG ACCGGGTCCG CGAGCTGGAC GACGACCAGC AGTTCCTGTT CGTGAACAAC ACCAAGCCGA TCCGGGCGGA GAAGATCCGC TACTACGACG AGCCGTTCTT CAAGGCGCGG ACGGGGGACT ATTTCCACGG CGTGCCCGCC AAGTACGAGC AGCGGCCGGG TACGGCTGAC CTGCCAGGGC CCGCTCAGAT CGACTGGCTT GGGGTTCGCG CGGCAGAGCC GGCCCCGGCC GGCCTGAAGG GTGTCGTGCC GCCGCCTGCG CCCGAAGAGA CGGATGATGG ACCTCAGCCC GCCGGCGACA GCGGGCAGGG CCTCCACGCC CCGGTCTCGA GCCTGCGATG GACTGGCGAC GACGATGACG ACGGCAGCCT TGGCTGA
|
Protein sequence | MSARPKIILP LAAASVLIAL SIATQIVAHD FHYPREFGHG LLDVGQARIY APWAFIGWYG RFAARYQQAF DMAAMIALAA VFVPSMLLIG LTKSTRRAPR EFGKDAWATE AHVRKAKLVH GDGQISGRVL GRFNGKYLTY RGVEHAIIVG ASRSGKGAGH VVPTLIAWPQ SAFVYDRKGE LWHITADHRK TFSHVFYFAP TDPNTVRWNP LFEVRKGPME IADIQNVVGI LVDPLGRKAG DLNFWDQSAT DFFTAIILHV LYSEEDTKKN LAQVRRLLIN IDPTLHAMKH TKHRHRPDLH APGGLARGAD GKPIAEVHPE ILLGATALDS MDERVKSNVL ATCRASLSLW ADPYVEYATS WSDFSIGDLV CSESPVTFYI ITPQAHADRL AFLVRVFTRQ TINSLMEREH FDSRGRRKAH RLLLLLDEFP KLGSLPFLEN AMGEMAGYGI TAHLICQSFN DVFSKYGDKT PIFDNMHITA TFATSEPTSI DKVIRRAGKA LEMRESYSDP RSIFGSSHRS TSQSEHERYI LTEDRVRELD DDQQFLFVNN TKPIRAEKIR YYDEPFFKAR TGDYFHGVPA KYEQRPGTAD LPGPAQIDWL GVRAAEPAPA GLKGVVPPPA PEETDDGPQP AGDSGQGLHA PVSSLRWTGD DDDDGSLG
|
| |