Gene Caul_0820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0820 
Symbol 
ID5898275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp877932 
End bp881363 
Gene Length3432 bp 
Protein Length1143 aa 
Translation table11 
GC content68% 
IMG OID641561301 
Productindolepyruvate ferredoxin oxidoreductase 
Protein accessionYP_001682449 
Protein GI167644786 
COG category[C] Energy production and conversion 
COG ID[COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCACT CGGAAGTCAC GCTTGACGAC AAATATCTGC TCGAAGATGG CCGCGCGTTC 
ATCACTGGCG TGCAGGCGCT GATCCGAGTC CTGCTCGATC GCAAGCGTCT GGACACCCGC
GCCGGACTCA ACACCGCCGG CTTCCTGTCC GGCTATCGCG GCTCGCCTCT CGGGGGATTG
GACCAGCAGG CCGCTCGCCT GAACAAGCTG CTGACCGCCC ACGACGTGGT CTTCAAGGAA
GGCCTCAACG AGGATCTGGC CGCCACCGCC GTGTGGGGCA GCCAGCAGGC CAATCTGTTC
CCGGGCGCCA CCCACGATGG CGTGTTCGGC ATGTGGTACG GCAAGGCGCC CGGCGTCGAC
CGCACCGGCG ACGTCTTCAA GCACGCCAAT TTCGCCGGCT CCTTCCCGAC CGGCGGCGTG
CTGGCGGTGG CCGGCGACGA CCATGCCTGC AAGAGCTCGA CCCTGCCCTC GCAATCGGAG
TTCGCCTTCC AGGACTTCGA AATGCCGGTG CTGTCGCCCG CCGACGTGCA GGAGGTGCTC
GACTACGGCC TGCTGGGCAT TTCGATGTCG CGGTTCTCCG GCCTGTGGAC CGGCATGATC
GCCCTGGCCG ACACCATGGA CTCAGGCGTG ACCATCGACG TCTCGCTGGA CCGCCACAAG
CTGGTGATCC CGGAGAACTT CCCCATGCCC GCTGGGGGAC TGGGCATCCG GCTGAAGGAC
CAGCCGATGG AGAAGGAGCG CCGGCTTCGT CTGCACAAGA TCCCCGCCGC CCTGGCCTTC
GCCCGCGCCA ACGGCGTCGA CCGGGTGGTG CTGGGCGCGC ACCATGTCCG CGTCGGCAAG
GCGCGGCTGG GCATCGTCTG CCAGGGCCAG GCCTACAAGG ATGTGCTGGA AGCCTTCACC
GCCATGGGCA TGAGCCTGGA GGAGGCGTCG GACCTGGGCG TCTCGATCTA CAAGGTCGGC
ATGCCCTGGC CGCTGGAGCC GTTGGGCATC CGCGCCTTCG CCGCCGGTCT CGAGACCCTG
ATGGTCATCG AGCACAAGCG CGGGCTGATC GAGCCGCAGG CCCGCTCGGC GCTCTACGAC
CTGCCGGCCC ACGCCCGTCC GCGGATCATC GGCAAGCATG ACGAGCAGGG GCATCCGCTG
CTGTCGGAGC TGGGCTCGCT GTCGGTGGCC GAAATCGCCC TGGCCATCTA TGACCGCCTG
CCGCCGGGCG CGCACATGGA GCGCGCCCAG GCCTATCTGA ACCGTGTTTC GGCCGCCGGC
GTCGCCGCCG TCAGTCTGGC CGCCGACCAG CAGCGCAAGC CGTTCTTCTG CTCGGGCTGC
CCGCACAACA CCTCGACCAA GCTGCCCGAG GGCAGCCGCG CCCTGGCCGG CATCGGCTGC
CACTACATGG CCAGCTTCAA CGACCCCTCG ACCGACCTGA ACACCCACAT GGGCGGCGAG
GGCCTGACCT GGGTGGGCGC CGCGCCGTTC ACCACCGAGA AGCACGTCTT CCAGAACCTG
GGCGACGGCA CCTACAACCA CTCCGGATCC CTGGCCATCC GGGCGGCGAT CGCGGCCAAG
GCCAACATCA CCTACAAGCT GCTGTTCAAC GACGCCGTCG CCATGACCGG CGGCCAGCGG
GCCGAGAGCG GCTTCACCCC CGCCCAGATC ACCCGCCAGT TGGCGTCCGA GGGCGTCACC
AAGACGGTGA TCGTGGTCGA CGAGCTGGAA CGCTACGAAG GGGTCACGGA TCTCGCCCCC
GGCGTCGAGG TCTTCCCGCG CAACGAACTG ATGATCGTGC AGAAGATGCT ACGCGACACC
GAGGGCACCA CGGTCCTGCT CTACGACCAG ACCTGCGCCA CCGAGAAGCG CCGCCGCCGC
AAGCGCGGGA CGATGGCCAA GGCGACCAAG CGCGTGTTCA TCAACCCGCT GGTCTGCGAG
GGCTGCGGCG ACTGCTCGAT CAAGTCCAAC TGCGTGTCGG TGGAGCCGCT GAACACCGAG
TTTGGCCGCA AGCGCAAGAT CAACCAGTCG TCCTGTAACC AGGACTACAG CTGCGTCGAG
GGCTTCTGCC CATCGTTCAT CACCCTGGAA GGGGCCGAGA GCGCCCAGGC CAAGAAGGTC
CCGGCCCTGA CGGCCGACTC CACGCCCCTG CCGGTGTTCG ACGAGTTCCA CGGCGTGCGG
AAGATCATCT TCACGGGCGT CGGCGGCACC GGCGTGACCA CCGTGGCCTC GATCCTGGCC
ATGGCCGCCC ACGTCGACGG CCGGGCCGGC AGCGTGGTCG ACATGACCGG TTTGGCCCAG
AAGGGCGGCT CGGTGTTCAG CCACGTCAAG ATCGGCGAGA CCGAGGAGAC CGTGGTCGGC
GGTCGCGTGC CCGCCGCCAG CGCCGACGTG CTGATCGCCT GCGACATGCT GGTGGCCGCC
TCGCCGGAGG GCCTGTCGCT GTACGCCAAG GACCGCACCA GCGCCTTTGG CAACAGCGAC
TTCGCGCCCA CCGCCGACTT CGTCACCAGC CGCGATATCC GCTTCGACAG CGGGGCCATG
GCTCGCCGGA TCAAGGGCGC GACCAAGAGC TTCGACGCCT GCCCCGCACA GCACCTGGCC
GAGACCCAGT TCGGCGACGC GATCTTCGCC AACATGATCA TGGTCGGCTT CGCCTGGCAG
CGCGGGGTGA TCCCGCTGTC CAGCCGCGCG GTCTATCGGG CCATCAAGCT GAACGGCGTG
GATTACGAGT CAAACCTGGC CGCCTTCGAA CTGGGCCGCC GCGTCGCCCA CGACCCCGCG
TCGATGGGGC CGCGCGATGC GGACGTTCCA ACGCCCGAGA CGATGCCGCT GGAAGACCTG
ATCGCCAAGC GCGCCGCCGA CCTGATCGCC TACCAGAACG AGGCCTACGC CAAGCGCTAC
CTGGCGCGGA TTGAGAAGGT CGCGGCGGCC GAGGCGGCCC ACGGCGGCGG CGAGTCCCTG
ACCCGCGCGG CGGCGGTCAA TCTCTACAAG CTGATGGCCT ATAAGGACGA GTACGAGGTC
GCCCGTCTGT ACACGGACGG ACGCTTCGCG GCCGAACTGG CCGGGACCTT CAAGGGCGGC
AAGGCCAAGG TCTGGCTGGC CCCGCCGATC ATCGGCGCCA AGAACAAGGA CGGCACGCCG
CGCAAGATGG CCTTCGGCGG CTGGATGCTC GACTACGCTT TCCCGGTGAT GGCCAAGATG
AAGGGCCTGC GCGGCGGGCC GCTCGACGTG TTCGGCGCCA CCGAGGAACG CCGCATGGAG
CGCGGCCTGA TCGCCGACTA CGAGATCACT CTCGACCGCC TGGTCGGCGG CCTGACGCCC
GAGCGCTTGC CATTGGCGGC CCGCATCGCC GCGATCCCGC AAGAGATCCG CGGCTACGGC
CACGTCAAGG ACGCCTCGGT GGTCAAGGCC AAGGCCGAAG CGACGACCTT GTGGTCCCAG
TGGGAGGGGT AG
 
Protein sequence
MRHSEVTLDD KYLLEDGRAF ITGVQALIRV LLDRKRLDTR AGLNTAGFLS GYRGSPLGGL 
DQQAARLNKL LTAHDVVFKE GLNEDLAATA VWGSQQANLF PGATHDGVFG MWYGKAPGVD
RTGDVFKHAN FAGSFPTGGV LAVAGDDHAC KSSTLPSQSE FAFQDFEMPV LSPADVQEVL
DYGLLGISMS RFSGLWTGMI ALADTMDSGV TIDVSLDRHK LVIPENFPMP AGGLGIRLKD
QPMEKERRLR LHKIPAALAF ARANGVDRVV LGAHHVRVGK ARLGIVCQGQ AYKDVLEAFT
AMGMSLEEAS DLGVSIYKVG MPWPLEPLGI RAFAAGLETL MVIEHKRGLI EPQARSALYD
LPAHARPRII GKHDEQGHPL LSELGSLSVA EIALAIYDRL PPGAHMERAQ AYLNRVSAAG
VAAVSLAADQ QRKPFFCSGC PHNTSTKLPE GSRALAGIGC HYMASFNDPS TDLNTHMGGE
GLTWVGAAPF TTEKHVFQNL GDGTYNHSGS LAIRAAIAAK ANITYKLLFN DAVAMTGGQR
AESGFTPAQI TRQLASEGVT KTVIVVDELE RYEGVTDLAP GVEVFPRNEL MIVQKMLRDT
EGTTVLLYDQ TCATEKRRRR KRGTMAKATK RVFINPLVCE GCGDCSIKSN CVSVEPLNTE
FGRKRKINQS SCNQDYSCVE GFCPSFITLE GAESAQAKKV PALTADSTPL PVFDEFHGVR
KIIFTGVGGT GVTTVASILA MAAHVDGRAG SVVDMTGLAQ KGGSVFSHVK IGETEETVVG
GRVPAASADV LIACDMLVAA SPEGLSLYAK DRTSAFGNSD FAPTADFVTS RDIRFDSGAM
ARRIKGATKS FDACPAQHLA ETQFGDAIFA NMIMVGFAWQ RGVIPLSSRA VYRAIKLNGV
DYESNLAAFE LGRRVAHDPA SMGPRDADVP TPETMPLEDL IAKRAADLIA YQNEAYAKRY
LARIEKVAAA EAAHGGGESL TRAAAVNLYK LMAYKDEYEV ARLYTDGRFA AELAGTFKGG
KAKVWLAPPI IGAKNKDGTP RKMAFGGWML DYAFPVMAKM KGLRGGPLDV FGATEERRME
RGLIADYEIT LDRLVGGLTP ERLPLAARIA AIPQEIRGYG HVKDASVVKA KAEATTLWSQ
WEG