Gene Caul_4255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4255 
Symbol 
ID5901716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4624046 
End bp4625722 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content49% 
IMG OID641564774 
Producthypothetical protein 
Protein accessionYP_001685874 
Protein GI167648211 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.669454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCCA AAACGAAAAC GGAACGTCTT GGGAGATTGA TATCTCACGG ATATTTTGCG 
CCTGAGCTGC CTCCATGCTT CGTATCAGAT CGATTCGCTC GATATCGAAA ATCGATCATC
TCCGCGATCG AGGCCCTCCC TCAAGAGCGA GGAAAGCCGG CATTTCATCG CTTCATTTCG
GAACCTAGTT GGTTTTATTT TCCAAGATAT GGGAAAGATG ATCGCCGCCA CGGGGTGCCC
AATCCTATTG CCCACCTCCT ACTTTCTCGA ACGATCTCTG ACAACTATGT TGATCTCAGA
AGCATCGCAA GAAAATCAGG TCTTTCGTCA TCGCCACCTG TTTTTGACTG GAGCGGCCCG
AGAGCTTTAA TTCGCCCCAA CATCGACGCA CGGGACGATT TCCGTGTCGA TCTGTCATCT
CGCCGCGAAG AATACGTGTC TGCCGATATT CGCGCTTTTT TTCATTCTAT ATACACTCAC
GCGATACCGT GGGCCATATA TGGAAAAGCT TGGTCAAAAA CTCATCGAGA TCCGTATCAC
TATGGAAACG CCATTGATCT TCTTTGTAGA AACGGACAGG ATGGACAGAC AATAGGTCTT
CCGGTTGGGC CAGATACCTC AAGACTTATC GCAGAAGTCA TAGTATCCGC TATAGATACA
GAGCTTCGCA CTCGGCTCAA CATTGGTGGC CGCGATGCAT CGCGATACAT CGACGACTAT
ACGATCAGCG GTGTAAATGG CGAGTCAGGC GAGGAAATAT TGGCGGCGCT ACGACAATCG
GCAGCGCTCT TTGAGCTTGA ATTAAACAAC GACAAGTCGG CTATATACCC GACATCCCAT
CGACAAAATA CCGGATGGAA ACAAGCTGTT AAGGCCCAAA TACCACCTCA GAACGCGGAT
GGCGCGGCCA TCCAACGCTT CTTCTATGAA GTTGGACAAA TCTGCGACGC TCACCCCGAA
GTAAATGTCG AAAAATACGC ACTACAGAAC GCTCGCTCGG CACTTATTCG AGCGAATGAA
TGGAAGAGCA TACAAAGCAA TCTCATTAGC GCCTATCGCC GGAACGCAAG CCTTGTTTCT
TTTATAGTTG AGGTAATTCT ACTCAGAAAG GCGGAGAATA ATGACATTGA CCTACGCAAT
CTAAAGGAAT TTATCGAACA TCGCATACCA GTCTTGGCGC GATCCAATAG AACCGGTGAG
ATAATCTGGT TCCTATTCAT GACCATAAGG CTTGGAATAT CTCTTTCAGC GAGAAAATTA
AGCCCCCTGC TATCCATAGA AAATTCAATG ATCGCCCTAC TGGTAGTGTG CGCCAGCTCG
AGAAACTTGG TGGAAGGCGA TCTAGATCTA CGGACTTGGA ACAGGGCGTT GAACGCCAAC
GGCCTTAGAG GTCCAATGTG GCTCTACGCC TATGAGGGCG TCACCCAAGG ATTCATCCCG
GGCGCGACTG ATAATTTCAT TGCAGCCGAT CCCTATTTTT CCCTATTGCA CGAGAAGCGA
GTGCAATTTC TGTCGATCGA ACAGGGCTTC ACTTCAATAG CGTCAACGCT ACGGAATCTT
CGAGGCGACA ATGACCGAAT GAGGCGCTTG AGAGAAGCTT TGGCGGATTT GGATCCCGAG
ATTGATGAGT TCGACGATGA GGACGACGGC GCTGGATTCG ACGACGACGG CTATTAA
 
Protein sequence
MNAKTKTERL GRLISHGYFA PELPPCFVSD RFARYRKSII SAIEALPQER GKPAFHRFIS 
EPSWFYFPRY GKDDRRHGVP NPIAHLLLSR TISDNYVDLR SIARKSGLSS SPPVFDWSGP
RALIRPNIDA RDDFRVDLSS RREEYVSADI RAFFHSIYTH AIPWAIYGKA WSKTHRDPYH
YGNAIDLLCR NGQDGQTIGL PVGPDTSRLI AEVIVSAIDT ELRTRLNIGG RDASRYIDDY
TISGVNGESG EEILAALRQS AALFELELNN DKSAIYPTSH RQNTGWKQAV KAQIPPQNAD
GAAIQRFFYE VGQICDAHPE VNVEKYALQN ARSALIRANE WKSIQSNLIS AYRRNASLVS
FIVEVILLRK AENNDIDLRN LKEFIEHRIP VLARSNRTGE IIWFLFMTIR LGISLSARKL
SPLLSIENSM IALLVVCASS RNLVEGDLDL RTWNRALNAN GLRGPMWLYA YEGVTQGFIP
GATDNFIAAD PYFSLLHEKR VQFLSIEQGF TSIASTLRNL RGDNDRMRRL REALADLDPE
IDEFDDEDDG AGFDDDGY