Gene Caul_4781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4781 
Symbol 
ID5902243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5162388 
End bp5164178 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content73% 
IMG OID641565301 
Productheparinase II/III family protein 
Protein accessionYP_001686399 
Protein GI167648736 
COG category[S] Function unknown 
COG ID[COG5360] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.997993 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGCG CCCCCGCGAC AGGTTCCGCC CGCAAGCCGA CCGCCCTCGC CGGCGGCCCC 
CAAGGGCGTG GCAAGGCCCA TGGTCGCGGC AAGGCGCCCA AGGGGCTGTT TTTCAAGGCC
TTGGGCGCTT CGATACGCGG CAATCTCGAG CGCGAATGGT TCGGTTCGCC GCCGCATCGC
GCCCTGATCA GCCGCCCGCG CCCCGTCGGC CTGGCCGTCC GTCCGCACGA CCCGCGACCG
GTCGACATCG AGCGCGGCCG CCAACTGCTT GACGGCGTCA TGACCCTGGA CGGCGGCGCG
CTGCGTCTGG GCGAGACCGG CGACCCCTTC GACCAGCCCA GCCCGACCCG TCAGTTCGCG
ACCGCCCTGC ACGGCTTCGA CTGGCTGCCG CATCTGTTGG CCGCCGGTCC CGGCGCGCCA
CGTCTGGCCT TGCGGCTGCT ACAGGACTGG CGGCGGGTGT TCGGGGTGTG GAACGCCTTT
TCATGGAGCG GCGAGCGGCT GGAGCGCCGC ACCTTCCACC TGGCCTGCGC CGCCCGCGCC
CTGTCGCCCG AGGCCTCCGA CGCCGAGATC TCGGCCATGA CCATGGACAT CGCCCGGGGC
GCCCGCCAAC TGCTCAAGGC CTCCGACGCC CCCGACCGCC GGCTGGAACG CGCCGTCGTC
GTGGCCATCG CCGGCTGCGC CCTGACCGGC AAGGCCAGCG ACCGGTTGAT GGCCGCGGGG
CTCAAGCGGG TGGCGGCCGA TATCGAGAGC ATCGTCCTGC CCGATGGCGG CCACGCCAGC
CGCTCGCCCG AGGCGGGGCT GGAGCTGTTG TTCGACCTGC TGACCCTCGA CGACGCCCTG
GGCCAGCGCG GCCGCCCGGC GCCCGAGGCC CTGGGCCGGG CCATCGACCG GCTGAGCTCC
TCGGTGCGGT TCTTCACCCT GGCCGACGGT TGCCTGGCGG CCTTCCAGGG CGGCGAGGCC
GTGGAGCCGC GCCGGGTGGC CGCGGCCCTG GCCCACGACG ACCACGGTCC GCGTCCGCCG
CAAAGCGCGC CGCATGTCGG CTATCAGAAG ATGCAGGGCG GCAGCATCCA GGTGATGGCC
GACGCCGGCC CGCCGGCCAG AGGCGTGCTC AGCGTCTCGG CCTGCGCCCA GCCGGCCGCC
GTCGAGATCG TCTGCGGCAA GGATCGGTTG ATCACCAGCT GCGGCTGGAG CCCGGAGGCC
ACCGGCGCCA ACGCCTTCCG GCTGTCGGAC GCCGCTTCGA CCCTGTCGGT CGGCGACGGC
TCTGCCGGAC GACCGCTGTC GGGCTGGCGA TCCGGCGCCC TGGGTCCCTG GCTGATCGAC
GGCGCGACCG ATGTCGAGAT CAAGCGTCAC GACGCCGACG TCGGGGTGTG GCTGGACATC
GTCCACGACG GCTGGCGCCG CCTGGGCCTG ACCCATGCCC GCCGCCTGTA TCTCGACCTC
AAGGCCGATG AGCTGCGCGG CGAGGACAGC CTGATCCCGC TCCCAGACAA AAATGGCGTC
TCGCCCCATG CCGACGGGCC GCGCCGCTAC CTGCCGTTCA TGATCAGCTT CCACCTGCAT
CCGGACGCCC GCGCCTCGCT GGCCCGCGAT GGCAAGAGCG TGCTGATCAA GGGGCCGTCC
AATGTCGGCT GGTGGCTGCG CAACGACGCC GTCGATGTCG CCATCGCCCC CTCGGCCCAT
TTCGACCACG GTCACGCCCG GCGGGCGGGA ACCATCGTGC TCAGGAGCCA GGTGCGTCCC
GAGAAGGGCG CCAAGATCCG CTGGAAGCTG GCCCGGGCGG CGGATCACTG A
 
Protein sequence
MDGAPATGSA RKPTALAGGP QGRGKAHGRG KAPKGLFFKA LGASIRGNLE REWFGSPPHR 
ALISRPRPVG LAVRPHDPRP VDIERGRQLL DGVMTLDGGA LRLGETGDPF DQPSPTRQFA
TALHGFDWLP HLLAAGPGAP RLALRLLQDW RRVFGVWNAF SWSGERLERR TFHLACAARA
LSPEASDAEI SAMTMDIARG ARQLLKASDA PDRRLERAVV VAIAGCALTG KASDRLMAAG
LKRVAADIES IVLPDGGHAS RSPEAGLELL FDLLTLDDAL GQRGRPAPEA LGRAIDRLSS
SVRFFTLADG CLAAFQGGEA VEPRRVAAAL AHDDHGPRPP QSAPHVGYQK MQGGSIQVMA
DAGPPARGVL SVSACAQPAA VEIVCGKDRL ITSCGWSPEA TGANAFRLSD AASTLSVGDG
SAGRPLSGWR SGALGPWLID GATDVEIKRH DADVGVWLDI VHDGWRRLGL THARRLYLDL
KADELRGEDS LIPLPDKNGV SPHADGPRRY LPFMISFHLH PDARASLARD GKSVLIKGPS
NVGWWLRNDA VDVAIAPSAH FDHGHARRAG TIVLRSQVRP EKGAKIRWKL ARAADH