Gene Caul_5331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5331 
Symbol 
ID5897119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp40137 
End bp42113 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content66% 
IMG OID641550623 
ProductTRAG family protein 
Protein accessionYP_001672109 
Protein GI167621601 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.390385 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCC GCCCCAAGAT CATCCTGCCG CTAGCAGCGG CCAGCGTGTT GATTGCCCTA 
TCGATCGCCA CTCAGATCGT CGCTCACGAC TTCCACTACC CGCGTGAGTT CGGCCACGGC
CTGCTGGATG TGGGGCAGGC GAGGATCTAC GCGCCGTGGG CGTTCATTGG CTGGTATGGG
CGCTTCGCGG CGCGCTACCA GCAGGCCTTC GACATGGCGG CGATGATCGC GCTCGCGGCC
GTGTTCGTCC CGTCGATGTT GCTGATCGGG CTGACCAAGA GCACGCGCCG GGCGCCGCGA
GAGTTTGGCA AGGACGCCTG GGCGACCGAG GCCCATGTCC GCAAGGCCAA GCTCGTCCAT
GGTGACGGCC AGATCAGCGG ACGGGTGCTC GGCCGGTTCA ACGGCAAATA CCTCACCTAT
CGCGGTGTGG AGCACGCCAT CATCGTGGGC GCGTCCCGCA GCGGGAAGGG GGCCGGCCAC
GTCGTTCCCA CCCTGATCGC CTGGCCGCAA AGCGCCTTCG TCTACGACCG CAAAGGGGAG
CTTTGGCACA TCACGGCCGA TCACCGGAAG ACCTTCAGCC ACGTCTTCTA TTTCGCGCCG
ACCGACCCCA ACACCGTGCG ATGGAATCCG CTGTTCGAGG TGCGGAAGGG GCCGATGGAG
ATCGCCGACA TCCAGAACGT CGTCGGCATC CTGGTGGACC CGCTGGGCCG AAAGGCGGGC
GACCTCAATT TCTGGGACCA GAGCGCGACG GACTTCTTCA CCGCGATCAT CCTGCACGTC
CTCTACAGCG AGGAGGACAC CAAGAAGAAC CTCGCCCAGG TCCGCCGCCT GCTGATCAAT
ATCGATCCGA CCCTTCATGC GATGAAGCAC ACCAAACATC GCCACAGACC GGACCTTCAT
GCGCCGGGCG GGCTGGCGCG GGGCGCCGAC GGCAAGCCCA TCGCCGAGGT CCACCCGGAG
ATCCTGCTGG GCGCCACGGC GCTGGACAGC ATGGACGAGC GGGTGAAGTC CAATGTGCTG
GCCACCTGTC GGGCGTCGCT ATCGCTGTGG GCCGACCCCT ACGTGGAATA CGCCACCAGC
TGGTCGGACT TCTCGATCGG CGACCTGGTG TGCTCGGAGA GCCCGGTCAC CTTTTACATC
ATCACCCCCC AGGCCCATGC CGACCGCCTC GCCTTTCTGG TGCGGGTGTT CACGCGCCAA
ACGATCAACA GCCTGATGGA ACGCGAGCAT TTCGACAGCC GGGGGCGGCG CAAGGCGCAT
CGACTGCTGC TGCTGCTCGA CGAGTTTCCC AAACTTGGCA GCCTGCCCTT CCTGGAAAAC
GCCATGGGCG AAATGGCCGG CTACGGCATC ACCGCCCACC TGATCTGCCA GAGCTTCAAC
GACGTGTTCT CCAAGTACGG GGACAAGACG CCGATCTTCG ACAACATGCA CATCACCGCC
ACCTTCGCGA CCTCGGAGCC TACGAGCATC GACAAGGTGA TCCGGCGCGC CGGCAAGGCG
CTGGAGATGC GCGAGAGCTA CAGCGATCCG CGCAGCATCT TCGGCAGCTC GCACCGCTCG
ACCTCCCAGA GTGAGCACGA GCGCTACATC CTGACCGAGG ACCGGGTCCG CGAGCTGGAC
GACGACCAGC AGTTCCTGTT CGTGAACAAC ACCAAGCCGA TCCGGGCGGA GAAGATCCGC
TACTACGACG AGCCGTTCTT CAAGGCGCGG ACGGGGGACT ATTTCCACGG CGTGCCCGCC
AAGTACGAGC AGCGGCCGGG TACGGCTGAC CTGCCAGGGC CCGCTCAGAT CGACTGGCTT
GGGGTTCGCG CGGCAGAGCC GGCCCCGGCC GGCCTGAAGG GTGTCGTGCC GCCGCCTGCG
CCCGAAGAGA CGGATGATGG ACCTCAGCCC GCCGGCGACA GCGGGCAGGG CCTCCACGCC
CCGGTCTCGA GCCTGCGATG GACTGGCGAC GACGATGACG ACGGCAGCCT TGGCTGA
 
Protein sequence
MSARPKIILP LAAASVLIAL SIATQIVAHD FHYPREFGHG LLDVGQARIY APWAFIGWYG 
RFAARYQQAF DMAAMIALAA VFVPSMLLIG LTKSTRRAPR EFGKDAWATE AHVRKAKLVH
GDGQISGRVL GRFNGKYLTY RGVEHAIIVG ASRSGKGAGH VVPTLIAWPQ SAFVYDRKGE
LWHITADHRK TFSHVFYFAP TDPNTVRWNP LFEVRKGPME IADIQNVVGI LVDPLGRKAG
DLNFWDQSAT DFFTAIILHV LYSEEDTKKN LAQVRRLLIN IDPTLHAMKH TKHRHRPDLH
APGGLARGAD GKPIAEVHPE ILLGATALDS MDERVKSNVL ATCRASLSLW ADPYVEYATS
WSDFSIGDLV CSESPVTFYI ITPQAHADRL AFLVRVFTRQ TINSLMEREH FDSRGRRKAH
RLLLLLDEFP KLGSLPFLEN AMGEMAGYGI TAHLICQSFN DVFSKYGDKT PIFDNMHITA
TFATSEPTSI DKVIRRAGKA LEMRESYSDP RSIFGSSHRS TSQSEHERYI LTEDRVRELD
DDQQFLFVNN TKPIRAEKIR YYDEPFFKAR TGDYFHGVPA KYEQRPGTAD LPGPAQIDWL
GVRAAEPAPA GLKGVVPPPA PEETDDGPQP AGDSGQGLHA PVSSLRWTGD DDDDGSLG