Gene Caul_2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2071 
Symbol 
ID5899526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2210873 
End bp2213884 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content67% 
IMG OID641562560 
ProductSMP-30/gluconolaconase/LRE domain-containing protein 
Protein accessionYP_001683697 
Protein GI167646034 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCGA TCCTATTGTC GGCGATCCTG GCGGCCTCGA CCCCCGCCGT GGCGGCGACC 
GCCTCCAACT CCGTCCTGCT GACGGCGCCG GACGATCCGC GCGCGATCAT GGTGAAAGGC
GCGGGCGATG GCCGCGCGGA CGACACCGAG GCCGTTCAAC ACGCCATCGA CGCCGCCCGC
GACAAGACTG GCCATGGCGT CGTCTTCCTG CCGTCGGGGC GCTATCGCCT GAGCCGCAGC
ATCCTGGTGC CGCCGGGCGT GCGCATCTTC GGTGTTGGGG CCAAGCGGCC TATCTTCGTG
CTTGGTCCGA ACACGCCGGG ATTCCAGAAG GGCGTGGGCA CGATGATCGT GTTCACGGGC
GGCGATCAGT ACGCCGTCGG CAAGATCCCC GTGCCGGTTC CGACCGCGGT GGGCGCCAAG
GGCGATGTGC GCGACGCCAA CTCCGGGACC TTCTATTCGG CGCTGAGCAA TGTCGATATC
GAGATCGGCG CAGGCAACCC GGCCGCGGCG GGGGTTCGGT TCCGCATGGC CCAGCACGCC
TTCCTCAGCC ACATGAACTT CAACCTGGGT ACGGCCTTCG CCGGCGTCTA CCAGGCCGGC
AACGTCATGG AGGACGTCCA CTTCCACGGC GGTCGCTATG GCATCGTCAC CGAAAAGACC
TCGCCGGCCT GGCAGTTCAC TCTGATGGAC TCCACCTTCG ACGGCCAGCG CGACGCGGCG
ATCCGCGAGC ACGAGGTGGA TCTGACGCTG GTCAATGTGG CGATGCGCAA CACGCCCGTC
GGCATCGAGA TCGACCGCGG CTACAGCGAC TCCCTGTGGG GCAAGAACGT CCGCTTCGAG
AACGTCTCCA AGGCCGCGGT GGTCATTTCT GCTGAAGACA ACGTCTTCAC CCAGGTGGGC
TTCGAAAACG CCGTGGCGTC CAACACGCCG GTGTTCGCCC GCTTCCGCGA CAGCGGCAAG
ACCGTGGCCG GCAAGGGCCG CGCCTATCGG GTGGCCTCGT TCACCTACGG CTTGACGCTG
CCGGGCCTTG GCCAGATGGG CGAGTACAAG ACGCAGGCCG ATATCACCAC CCTGCCGGGC
TTGTCGAAAG CCGCCGCGCC GGCCATCCGC CCGCTGCCGC CGGTGAGCGC GTGGACCAAT
GTCAAGACGC TCGGCGTCAA GGGCGACGGC GAGACCGACG ACACCGCCGC GTTGCAGGCG
GCGATCGAAA CCCACCGCGT GCTCTACCTG CCGATCGGCT TCTACAAGAT CACCGACACG
CTGCGGCTTC GGCCGGACAC CGTGCTGATC GGCCTGCATC CGGCCATTAC CCAGCTTGTC
CTGCCCGACA ACAATTCCCG CCATGCCGGG GTCGGCGCCG TCGCGCCGAT GATCGAAACG
CCGAAGGGCG GCGACAACAT CCTGGCCGGG ATCGGGCTGT TCACGGGACG CGTCAATCCG
CGCGCCTCGG CGCTGCTCTG GCGGTCGGGC GAGACCTCGC AGGTCACGGA CGTCAAGATC
ATGGGCGGCG GCGGCACGCC GACCATGGAT GGCAAGCCGC TGGGCGCGGC CCAGGCGCGC
AGCGGCGATC CGGTGGCCGA CGGCCGCTGG GACGGGCAAT ATCCCAGCAT CTGGGTGACT
GATGGCGGCG GCGGGACCTT CGCCAATGTC TGGAGCCCCA ACACCTTCGC CTCGGCCGGA
TTCTACATCT CCGACACCAG CACGCCGGGC CACATCTACG AGGTGTCGGT CGAGCACCAC
GTCCGCAACG AGTTCGTGCT CGACAACGTC CAGAACTGGG AATTCCTGGC GCCCCAGACG
GAGCAGGAGG TCGGTGACGG GCCGGACGCC GTGTCGCTGG AAATCCGCAA CTCGCGCAAC
ATCCTGTTCG CCAACTATCA CGGCTATCGG GTGACGCGGT CGTACCACCC CGCCGAAACG
GCTGTGAAGC TGTTCAACTC GTCCGACATC CGCTTCCGCA ATGTCCATAT CAACGCCGAG
AGCGGCGTCG CGCTGTGCGG CGCGAGCGGC TGCGGGACCT ATCTGCGGGC CAGCAAGTAT
CCCTTCGAGA ACGCGATCCA GGACAAGACC CGCAAGCTGG AGGTGCGCGA ACGTGAGTTC
GCGGTGCTCG ACGTCACCAG CGAGCCCGGG CCCGCCGCGC CGGCGCGCGA CGAGAAGGTC
CGGAAGCTGG AGACCGGCTT CTGGTCGATC TCGGGCGCGA CGGTGGGCGC GGACGGCGCG
CTCTATTTCG TGGAGAAGCG GTTCCAGCGG ATCTATCGGT GGACGCAGGC TCGGGGTCTC
GAGGTGGTGC GCGACCAAGC GCTCGACCCG GTGAACCTGG CGGTCGACCG TTCCGGCAAG
CTTCTTGTGC TCTCCTCTTA CGGTCCAGAG GGAAGCGTCT ATTCGATCGA TCCGGCTGGA
CCCAAGGACC AACTGACGAT GATCGCGGCG ACCCCGTCCG CGTCGCGTCC AGACGCCAAG
ACGCTGCTGC CGGTCAACTG GTGGAACAAC GGCGAGTTCA GGGACCAATA CGATCCGTCG
ACCGACCATT TCACCACCCT GGCCGAAATG TTCGCCCGCG ACGCCGGCGC GCCCAAGGAC
AAGTCCTACG TCTCGCCTGA CGGCAGCCTT GTCCTGCCGG CCTTCCGTGT GTGGGAGCAA
GGTCCTCCCG ATCACGTCGG CTGGCGCTGG TCCGACACCC TGCAGACCCA TGGCCTCATC
GCGGCTGAAC CCGGCGAGCG GGTGTTCGTC ACCAACGAGT CCGAAGGCAA GACCTATAGC
GGCAAGGTCG GTCCCGGCGG CGCCCTGACG GACCTGAAGG TGTTCGCCAA CCGGGGCGGG
GAGAGCGTCG CGGTCGATGA CCAGGGACAG GTGTTCGTCG CCAATGGCCA GATCTTCAAA
TATGCGGCCG ACGGCCAGCC CACCGGGCGC ATCGACCTGC CCGAGCGGCC GCTGCAGCTG
ATCTTCGGCG GCCAGGACCG GCGCACGCTG TTCGTCCTGA CCCACCACTC GCTCTATGCG
GTGACGCCAT GA
 
Protein sequence
MRAILLSAIL AASTPAVAAT ASNSVLLTAP DDPRAIMVKG AGDGRADDTE AVQHAIDAAR 
DKTGHGVVFL PSGRYRLSRS ILVPPGVRIF GVGAKRPIFV LGPNTPGFQK GVGTMIVFTG
GDQYAVGKIP VPVPTAVGAK GDVRDANSGT FYSALSNVDI EIGAGNPAAA GVRFRMAQHA
FLSHMNFNLG TAFAGVYQAG NVMEDVHFHG GRYGIVTEKT SPAWQFTLMD STFDGQRDAA
IREHEVDLTL VNVAMRNTPV GIEIDRGYSD SLWGKNVRFE NVSKAAVVIS AEDNVFTQVG
FENAVASNTP VFARFRDSGK TVAGKGRAYR VASFTYGLTL PGLGQMGEYK TQADITTLPG
LSKAAAPAIR PLPPVSAWTN VKTLGVKGDG ETDDTAALQA AIETHRVLYL PIGFYKITDT
LRLRPDTVLI GLHPAITQLV LPDNNSRHAG VGAVAPMIET PKGGDNILAG IGLFTGRVNP
RASALLWRSG ETSQVTDVKI MGGGGTPTMD GKPLGAAQAR SGDPVADGRW DGQYPSIWVT
DGGGGTFANV WSPNTFASAG FYISDTSTPG HIYEVSVEHH VRNEFVLDNV QNWEFLAPQT
EQEVGDGPDA VSLEIRNSRN ILFANYHGYR VTRSYHPAET AVKLFNSSDI RFRNVHINAE
SGVALCGASG CGTYLRASKY PFENAIQDKT RKLEVREREF AVLDVTSEPG PAAPARDEKV
RKLETGFWSI SGATVGADGA LYFVEKRFQR IYRWTQARGL EVVRDQALDP VNLAVDRSGK
LLVLSSYGPE GSVYSIDPAG PKDQLTMIAA TPSASRPDAK TLLPVNWWNN GEFRDQYDPS
TDHFTTLAEM FARDAGAPKD KSYVSPDGSL VLPAFRVWEQ GPPDHVGWRW SDTLQTHGLI
AAEPGERVFV TNESEGKTYS GKVGPGGALT DLKVFANRGG ESVAVDDQGQ VFVANGQIFK
YAADGQPTGR IDLPERPLQL IFGGQDRRTL FVLTHHSLYA VTP