Gene Caul_4106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4106 
Symbol 
ID5901568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4459880 
End bp4462282 
Gene Length2403 bp 
Protein Length800 aa 
Translation table11 
GC content69% 
IMG OID641564626 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_001685728 
Protein GI167648065 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0810] Periplasmic protein TonB, links inner and outer membranes 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0612134 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.370277 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGAC ACCGTCACAC GATCGCCGGC GACAGCACAG CCGCCGGGCG TCGAGGCGCT 
TTCGCCGGCC GCAGGGCAGC CCTGGCCGCC TCCAGCTGTC TGGCGGGCGT AGCGGCCTGC
GCCCTGATCG GCCTGGCCGG CGCGATGACC CTGGGGACGG CGGCGCAGGC CCAGACCCTG
CCGACGGGCG GAACGGTCGC GGCGGGCGGC GCGACCATCA CCACCGGCCC CGGCGCGATG
ACCATCAACC AGTCGACCCC GAACGCCGCG ATCAACTGGC AGAGCTTCTC CATCGGCCAG
GGTGGCAGCG TCGTCTTCGT CCAGCCCGAC AGCCATTCGG TGGCGCTGAA CCGCGTGCTG
GGGCCGGACG CGTCAACGAT CTTGGGCGGG CTGACCTCCA ACGGCCAGGT GTTCCTGGTC
AATCCCAATG GCGTACTGTT CGGACAAGGG GCTCAGGTCA ATGTCGGCGG GCTGGTCGCA
TCCACTCTGG GAATGACTGA CGCTGACTTC ATGGCCGGAA ACTACCGGTT CTCGGGCAGC
GGAGGGATCG TCCGCAACCA AGGCGATATC ATCGCGACGG GCGGTTACGT CGCGCTGCTG
GGAGGCCAGG TCAGCAATGA CGGACTGATC CAGGCCAATC TGGGCACGAT CGCGCTGGCG
TCGGGCGAAG CCATCACCCT CGACGTCGCC GGCGACGGGC TGCTCAATGT CGTCATAGAG
AAGGGCGCAG CCAACGCCTT GATCCAGAAC AGCGGCATGC TTCAGGCCAA CGGCGGTCGG
GTGGTGATGA CCGCGCAGGG CGCGGGCGAC TTGCTCCGCA CGGTGGTGAA CAACACCGGC
GTCATCCAGG CGCGGACGAT CGGTCAGCGT AACGGAACCA TCCAACTGCT CGGCGACATG
CCAAGCGGGA CGCTGAACGT GGCCGGTGCC CTTGACGCCA GCGCGCCGGG CGGCGGGAAT
GGCGGGTCCA TCCAGACCTC CGCCGGGAGC GTGAACATCG CCTCCACGGC GGGGATCACC
GCGGCCGCCC CGACGGGCGT CGCGGGGATC TGGTTGATAG AGCCGGCCGA CTTCACGATC
GGCGCTGGCG GCAATATCTC CGGCGCGACC CTGTCGGCCC AGCTGGTGAC CACCAATGTC
ACGATCAACA CGCGGACGGC CGCCGGGCTG TCGGGTACAG GGGATATTCT CGTCAATGAC
GCGATCGTCT GGACGGCGTC GTCCACCCCC ACCACCCTGA CGCTGAACGC CAACCGCGAC
ATCAACATCA ACGCCGCGAT CACCGCCACA AAGGGCAATT TCGTCGCTTG CTGCGGGCGC
GATGTGGCCG TCAACGCCCC CATCACCACG GTGAACGGCA GCGTGCTGCT GAACGCCGGC
CAGAACGTCA CCGTGTTTCA CGCGATCACC ACCACGGACG GCAACATCGC CCTGTGCGCC
GGGCATGACG TCCATATCGA CGCGGCCGTC ACCCTGACCC GCGGCAGCAC CATTCCCGCC
CAGAGCCTGG GCCTGCCCGT CGGCCTGACC CTCATCGCCG GCGCGGGCGG GACGGGTCCG
GGCGTGGGCG GCGGCACGAT CATCTTCAGC CCGTTGGCGC CCCGCGTCAC GGTCACGGCC
ACCCCGGTCA CGATCAATTA CAACCCGGTT TCCTACGCGG CGCCGGCGGA CTTCTCGACC
CGGTTCACCC TGACCGAGGG CGCCGCCCTG ACACAGCGGA TGCTGCTGTT CCCGGACGGA
AGCCGGGTGT TCGACGGCGG GACGGCCACG ACCCTCTCCG GCTTCAGGAC CACGGCGACC
TCGGGGTTGC CCACGGGCGT CACCTTGGTG GCGGGCCCCG GCGCGACCGC GACCTTCGAT
TCGGCCGCCC CGGGCGCCGA CGTCGGGATC ACCTACAGCG GCTACACCCT GGCTGGGGCG
AACGCCGACC AATACGCCTT GGCGGGCTTC TGTTGTGTAT CGACTCAGAG AACGCAAGGC
ACGATCTCGG CGGCGGTGGT CACGCCGCCA GTGACCCCGC CAGTGACCCC GCCAGTGACC
CCGCCGGTTA CTCCGCCAGT GGTCCCGCCG GTGGTCCCCC CAGTCACCCC GCCCGTGACG
CCGCCCATAA CGCCGCCGGT GGTCCCCCCG GTGACGCCGC CGGTGACGCC GCCCGTGACG
CCGTCTCCGG CCTCGCCGGG ACCGACCGCC TTCTACCCGA TCATCACGCC AACCCCGGCC
TTGGTCGCCT CGCCCGATCT GGCCTTCAAC GTGGTGGGGG GAGGCGTGCG GATGCCGCCT
TACGAATCGG CCCGCATCTC TCCGCCGGTG GAGGAGGTCG TTCGGACGGT GGAGAAGACC
GCGCCGGTCG CGCCGCGTCC TGTGCAGGTC CCCGTCTATC CCCGCAAGCA GGATCGCAAC
TGA
 
Protein sequence
MTRHRHTIAG DSTAAGRRGA FAGRRAALAA SSCLAGVAAC ALIGLAGAMT LGTAAQAQTL 
PTGGTVAAGG ATITTGPGAM TINQSTPNAA INWQSFSIGQ GGSVVFVQPD SHSVALNRVL
GPDASTILGG LTSNGQVFLV NPNGVLFGQG AQVNVGGLVA STLGMTDADF MAGNYRFSGS
GGIVRNQGDI IATGGYVALL GGQVSNDGLI QANLGTIALA SGEAITLDVA GDGLLNVVIE
KGAANALIQN SGMLQANGGR VVMTAQGAGD LLRTVVNNTG VIQARTIGQR NGTIQLLGDM
PSGTLNVAGA LDASAPGGGN GGSIQTSAGS VNIASTAGIT AAAPTGVAGI WLIEPADFTI
GAGGNISGAT LSAQLVTTNV TINTRTAAGL SGTGDILVND AIVWTASSTP TTLTLNANRD
ININAAITAT KGNFVACCGR DVAVNAPITT VNGSVLLNAG QNVTVFHAIT TTDGNIALCA
GHDVHIDAAV TLTRGSTIPA QSLGLPVGLT LIAGAGGTGP GVGGGTIIFS PLAPRVTVTA
TPVTINYNPV SYAAPADFST RFTLTEGAAL TQRMLLFPDG SRVFDGGTAT TLSGFRTTAT
SGLPTGVTLV AGPGATATFD SAAPGADVGI TYSGYTLAGA NADQYALAGF CCVSTQRTQG
TISAAVVTPP VTPPVTPPVT PPVTPPVVPP VVPPVTPPVT PPITPPVVPP VTPPVTPPVT
PSPASPGPTA FYPIITPTPA LVASPDLAFN VVGGGVRMPP YESARISPPV EEVVRTVEKT
APVAPRPVQV PVYPRKQDRN