Gene Caul_0981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0981 
Symbol 
ID5898436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1037379 
End bp1038950 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content66% 
IMG OID641561463 
Producthypothetical protein 
Protein accessionYP_001682609 
Protein GI167644946 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.478644 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTCG CGCGGGCGCA GAGGCGCATT TGGAAGGTGC CCGCCATGTC CATCATCATC 
CTCAATGACC CGACCTACGG CGTCGTCCTG TTCGGCGACA CCCAAGACGA GTTGATCGCC
AGTCAGGTCG GCCTCTCGCA AACGCTGGCC GAGACTTACA CCGTGGACGA GCACGGGGAC
AGCATCCCCC TCGATTACCA GGTGGGCGAC GCCTCCGGCA TCCGGGGGAA CGCGCTAGGC
GGTGACGATC ATATCACCAG CTATGGTTCC GCGGTTGGAG ACGCCTTTAC GCTGGCCGAA
CACGGGCGAG GCGGGATGGA CACCATCTTC GCCCACTCCG GGCACGGGTC GGCGTTTGGC
GACGCCCTGA CCATGACCGG TCACGCCAGC GGCGGTGACG ATCTGATCAC CATCTTCAAC
GACTCCTTCG GGGGTTGGGA TGGTTCGATG TTCGGCGACG CCAAGCTGAT GACGGATGAC
GCCCGGGGCG GCAACGACAC ATTGAAAGGG TTGTCCGACC ACCTCAGCGA CGAGGTCGTG
CTCTATGGCG ACGCCCTTGA GATGAACGGT CGCGCCCAGG GGGGCGACGA CATTCTCGCC
GGCGACATGT GGACGTCCAA CTACCAATAC GGAGACGCCC AAACCCTGTC GGAGCAGGCG
CGGGGCGGCA ATGATCGGCT GTTCGGCGGG GACTATTCAT GGACCGAGCT TAACGGCGAC
GCCTATCTGC TGACCGACAA CGCGGTGGGC GGCAATGACC TGATCACCGG CGGAAGCGCC
TACGACGTGT CTGAGGGCGC CAATGATATG CTCGGCGACG GCTACCAGCT CGCGGGCCAC
GCCATCGCCG GAGACGACGT GCTGATCGGC GGGAGGGGCG ACAGCAACAC CATGTGGGGT
GATGGAGTTC TGATCGGGCC GGACGTGACC CGCGGCCACA ACAGGTTCGT GATCTCCCCG
TCCGGCGAGA TCGACACCCT CAAGGACTTC AATCCCGGCC ACGACCAGAT CGTGCTGGCG
GGGTTCACCT ACACCGCGTT CGCCGACATC GCCGGCGCCA TTCACCCCAC GGACACGGGC
GTGCAGATCG ATCTTGGTGC TGACGGCCTC GTCATCGTCG AGGGCGTGAC CCAGCTCACC
GCCGCCGATG TGACGTTCGA CGCCAACGCC CGCAAGGTCG CGGGCGGGTC GCACAATGAC
GTGCTCACAG CGGCGGGCGG CAACAACGCC TTCCATGGCG GCCTGGGGGA CGACACCTTC
ATCATCCAGG CGGTGGGCCT GGCCAACTCG GTCGGCGGCT CCACCCAGGG GGTCGGGGCC
GACAGCGTCA TCTGGGACTT CGCCGGGGCT GGCGGCACGC CCGCCGGCGC GAACGACCTG
CTCCAGCTCC AGGGCTTTGG CCCGGGCTCC ACCCTGACCT TCCTCCGCTT CGGCGGCTTG
CGCGCGGGCG GACCCGACCC AACCCTGCAG TACTATTCCG TGCACGACAC CATGGGCGGA
CCCAACCACG TGCTTTTCGT CCATTCGTTG AACGGCCAGT TGCTGACCTC GGCGGACTAC
GGCTTCATCT GA
 
Protein sequence
MAVARAQRRI WKVPAMSIII LNDPTYGVVL FGDTQDELIA SQVGLSQTLA ETYTVDEHGD 
SIPLDYQVGD ASGIRGNALG GDDHITSYGS AVGDAFTLAE HGRGGMDTIF AHSGHGSAFG
DALTMTGHAS GGDDLITIFN DSFGGWDGSM FGDAKLMTDD ARGGNDTLKG LSDHLSDEVV
LYGDALEMNG RAQGGDDILA GDMWTSNYQY GDAQTLSEQA RGGNDRLFGG DYSWTELNGD
AYLLTDNAVG GNDLITGGSA YDVSEGANDM LGDGYQLAGH AIAGDDVLIG GRGDSNTMWG
DGVLIGPDVT RGHNRFVISP SGEIDTLKDF NPGHDQIVLA GFTYTAFADI AGAIHPTDTG
VQIDLGADGL VIVEGVTQLT AADVTFDANA RKVAGGSHND VLTAAGGNNA FHGGLGDDTF
IIQAVGLANS VGGSTQGVGA DSVIWDFAGA GGTPAGANDL LQLQGFGPGS TLTFLRFGGL
RAGGPDPTLQ YYSVHDTMGG PNHVLFVHSL NGQLLTSADY GFI