Gene Caul_5423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5423 
Symbol 
ID5897197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp135505 
End bp136605 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content66% 
IMG OID641550711 
Productputative transposase 
Protein accessionYP_001672197 
Protein GI167621689 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.365485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGTCGTG CGAGTCTGCG GATGCGTACT GGCGTCACCG TCCACCTGAG CCCGACCGAT 
CGCAAGCGCC TGCGGGTGAT CGTCGATGAC GGCAACAGCC CCCAGAAGCA TGTCTGGCGC
GCCAGGATCG TGCTGTGCAC GGCCGACGGG CTTGGCACGG CGGCGATCAT GCGCGCGGCA
GGGGTCAGCA AGACCGCCGT CTGGCGCTGG CAGGAACGCT TCATGGATGA AGGCGTCGAT
GGCCTGCTGC GCGACAAGAC CCGCCCCGCG CGGGTTCAGA AGCTGGCCGA CGAGGTCGCC
GAGCGTATCG TCGCCCTGAC CCTGGGCGAA CCGCCCGGCG AGACCACCCA CTGGACCGGC
CGGGTGATGG CGGGGGTCGC CGGCGTCAGC CTGACCTCGG TGCAGCGTAT CTGGAAGGCC
CATGGCCTGG CCCCGCATCG CATCCGCACC TTCAAGCTCT CCAATGATCC CAGGTTCGCC
GCCAAGGTCC GCGACATCGT CGGCCTGTAT GTCGATCCCC CCGCCCACGC CGTGGTGCTC
AGCGTCGATG AGAAGTCGCA GATTCAGGCG CTGGACCGCA CCCAGCCGGG GCTGCCGCTG
AAGAAGGGGC GGGCTGGAAC CATGACCCAT GATTACAAGC GACACGGCAC GACCACCCTG
TTCGCCGCCT TCGACGTGCT GGAAGGCAAG GTCATCGGCC GCTGCGTGCA GCGCCACCGG
CATCAGGAGT TCATCCACTT TCTGAACGCC GTCGAGCGCG AGGTCCCGGC CGGAAAGACC
GTCCACGCCA TCCTCGACAA CTACGCCACC CACAAACACC CCAAGGTGAT CGCATGGCTG
GGCCGACATC CGCGCTGGAC GTTCCACTTC ACGCCCACCT CGGCCAGCTG GATCAACGCC
GTCGAGGGCT TCTTCGCCGT CCTCACCAAG CGCCGCCTCA AGCGCGGCGT CTTCAAGGGC
GTCGTCGATC TGCAGGCAGC CATCAACCGC TTCGTCGCCG AGTACAATCA GCATCCAAAG
CCCTTCGTCT GGACCGCCGA TCCAGACAAA ATCATCGCCG CAGCGAACCG TGGGCACCAA
ACGTTGGAAT CAATCCACTA G
 
Protein sequence
MGRASLRMRT GVTVHLSPTD RKRLRVIVDD GNSPQKHVWR ARIVLCTADG LGTAAIMRAA 
GVSKTAVWRW QERFMDEGVD GLLRDKTRPA RVQKLADEVA ERIVALTLGE PPGETTHWTG
RVMAGVAGVS LTSVQRIWKA HGLAPHRIRT FKLSNDPRFA AKVRDIVGLY VDPPAHAVVL
SVDEKSQIQA LDRTQPGLPL KKGRAGTMTH DYKRHGTTTL FAAFDVLEGK VIGRCVQRHR
HQEFIHFLNA VEREVPAGKT VHAILDNYAT HKHPKVIAWL GRHPRWTFHF TPTSASWINA
VEGFFAVLTK RRLKRGVFKG VVDLQAAINR FVAEYNQHPK PFVWTADPDK IIAAANRGHQ
TLESIH