Gene Caul_3802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3802 
Symbol 
ID5901264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4117831 
End bp4120149 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content60% 
IMG OID641564324 
Producthypothetical protein 
Protein accessionYP_001685426 
Protein GI167647763 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0133094 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00280043 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGGCCG AGAGCGCGCA AATCGAACTT CCGGCACAGC ACGAGCGTCA GGCGCACGGT 
CCTTGGAGCG ACCTAGCGCT CCATACCATC GGTTGGCGCG CGTTTCAGGA TCTCTGCTCG
CAGGTATGCG AGGTCGTGCT CGGCCAGCCC GTGGAAATCT TCCGCGAAGC TCAGGACGGT
GGGCAGGACG CGGTTTTTCT CATCCCCTCA GGAAGCGACG CGCCGCCGAT CGGTACGGTC
CAGTGCAAGC ATACGTCGGA GGCCGCAAAG GCCCTGAAGG CGAGCGATCT CACCGCCGAG
ATCGATAACG TCGAAGAGCT GGTGAAGGCC GGCCAGGCAG ACACCTACGC CTTCATGACC
AATATGAGCG TGGATGCACC CGTCGCGGCC GCCATGCGCG CCCGGCTTCG CGCGCTTGGC
GTGCGCAAGC CGCACATTCT CGGCCGCCAG TACATCGTTC GGGTCATCAA GAGCAGTGCG
CGCCTTCGTG CGCTTGTCCC GCAGGTTTAC GGCCTTGGGG ATCTAACATC GATCGTCGAT
GAGAGGCTCA GCGAACAGAG CCGTGCGCTG CTCGACAGCT GGATTCCGAA ACTCCGCACC
TACGTTCCCA CTAAAGCCCA CCGCGACGCG ATCAACGCGA TTTCCAACCA TGGCGTGGTG
CTGCTGCTCG GCAATCCGTC CAGCGGTAAG TCTGCGATTG GGGCCATCAT CTCGACAATC
GCTTCGGAAA ACCCCGCCAA CACCGTCCTC GCTTTGACCA GCCCTCGCGA TTTCGAGGCG
GGCTGGAATC CAAACGACCC GGGCCGCTTC TTCTGGATCG ACGACGCTTT CGGCTCGAAT
GTGCTGCGCA ACGATTTTGT GCAGGACTGG ACGTCGGCCT TCTCGAAGCT GAGGGCGGCG
ATCAAGCACG GTAACCGATT TCTCCTGACC TCCCGCAAGC ACATCTACGA AGCGGCGCGA
CGCCGGCTGG GGCAGCGCAA CCTTGCGCAG TTCGCTGATG GTAGCGCTGT TGTCGATGTC
GGCGAGCTGA TCTTTGAGGA GAAGGCGCAG ATCCTCTACA ACCATTTGAA TTTTGGCGAG
CAGAGCCAAA GCTGGCGTTC AACCGTCAAG CCCCACCTCG CCGCTGTCGC TGCTGTTCGC
GACTTCCTCC CCGGCATCGC CGAGCGTCTC GGCGACCCGA ACTTCACCAA GGGCTTGGCG
CCGCGCGAAA GTTCCCTCGT TCGATTTATG GAGGAGCCGA CGGAACATCT GATCGACACC
GTCAACGCCT TGGACGATCA GCTGCAAGCC GCGCTCATCC TTGTTTATGT TCACCAGACT
GGGTTCGATC CTAGCGATTA CGATGCTTCG GCCGCACAAG CGGTCGCAGA ACTGACTGGC
TACACTCTCA CCAAGATTCA GGATTGCTTC GCCGAGCTGA AAGGCTCGTT CCTGAAACTC
TCCGGTTCAA AATGGACTTT CGCACATCCC ACGATCTCCG ACGCCCTGAC CGAGATTCTG
CGCCAGAAGC CACATATGAT GGCGGCGCTC ATAAGGGGCG CGACTATCGA CACCATTCTC
AGCAGTTTTA CGTGCGAGGG GTCGCCTCTT ATTCGAGACG CCCTTCTCAT ACCCGCAACA
CTCGACGACG CTTTGGTCGC TCGGCTCGGC CACACACCAG ATGAATGGCA CCGCAATTGG
ATGCTGTTCC ACTTTTTGTC TTATCGCGCC AACGAACACG TGTTCGTCAG TGCAGTTCAA
CAATTTCCGC AACTACTTCG GCGGTCCTCC TGGGACTCCG ATCTGGTTAG CAACGATCCT
CATGTCGCCA CATACGCGCG TGCCCATCGC CTCAACCTGT TGCCCGACGA CCTGCGCTCG
GAGGCGGCGA ACAAGCTAGA ATCTGCCGTC CTCAACGATC TCGACGTCTC CTTCTTCGAC
GAGCCGGAGA TGTTGGCTCT GATACCGCCG CTGAGCCTTA TCGGCGTTGG CTTGGCGTTG
CGGACGACTG TGCTGCCGTC GCTTGAAGAG CGGATCGCCG AGATCGCCGC GGATGCCGAT
CTGGACGAAG AGCCTGACAG CCACTTCAAG AAGCTTCTCG GCGTGCTTGA TTGCGTAGAG
GCGATCGGCA TCGACGCGGA CTCCACCGTC TTGATCGATG ACACACGCGA TCAGGTGAGA
CGGTCAATCA AGGCACTTGA AGAGCGCAAG CGGGAGCGCG ACGAAGAGTC CGACGACGAC
ACAGATTGGA CTCACATCGT AACGCAGAAG AAGGATGATA CCCCCGCGCC ACCTGCTGCC
GCCACGAAGC GCTCAGTGTT CGATGATGTC GATAAATAG
 
Protein sequence
MTAESAQIEL PAQHERQAHG PWSDLALHTI GWRAFQDLCS QVCEVVLGQP VEIFREAQDG 
GQDAVFLIPS GSDAPPIGTV QCKHTSEAAK ALKASDLTAE IDNVEELVKA GQADTYAFMT
NMSVDAPVAA AMRARLRALG VRKPHILGRQ YIVRVIKSSA RLRALVPQVY GLGDLTSIVD
ERLSEQSRAL LDSWIPKLRT YVPTKAHRDA INAISNHGVV LLLGNPSSGK SAIGAIISTI
ASENPANTVL ALTSPRDFEA GWNPNDPGRF FWIDDAFGSN VLRNDFVQDW TSAFSKLRAA
IKHGNRFLLT SRKHIYEAAR RRLGQRNLAQ FADGSAVVDV GELIFEEKAQ ILYNHLNFGE
QSQSWRSTVK PHLAAVAAVR DFLPGIAERL GDPNFTKGLA PRESSLVRFM EEPTEHLIDT
VNALDDQLQA ALILVYVHQT GFDPSDYDAS AAQAVAELTG YTLTKIQDCF AELKGSFLKL
SGSKWTFAHP TISDALTEIL RQKPHMMAAL IRGATIDTIL SSFTCEGSPL IRDALLIPAT
LDDALVARLG HTPDEWHRNW MLFHFLSYRA NEHVFVSAVQ QFPQLLRRSS WDSDLVSNDP
HVATYARAHR LNLLPDDLRS EAANKLESAV LNDLDVSFFD EPEMLALIPP LSLIGVGLAL
RTTVLPSLEE RIAEIAADAD LDEEPDSHFK KLLGVLDCVE AIGIDADSTV LIDDTRDQVR
RSIKALEERK RERDEESDDD TDWTHIVTQK KDDTPAPPAA ATKRSVFDDV DK