Gene Caul_4552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4552 
Symbol 
ID5902013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4928331 
End bp4931624 
Gene Length3294 bp 
Protein Length1097 aa 
Translation table11 
GC content69% 
IMG OID641565071 
Productautotransporter beta-domain-containing protein 
Protein accessionYP_001686170 
Protein GI167648507 
COG category 
COG ID 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.518565 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGCA AGGTCCTGGT CGCGACAGTC GCAACCGCTC CTCTCCTGGC CATGGCGTTC 
GGCGCTTATG CTGAGACCGT CGTCAACACC GCCCGTACGA CCCCGATCGC CACGGCGACG
GCGGCCGGCG CGGCCCCGGG GACGCCCGAC GACGTGAAGA TCGACGCCGC CGGCTCGATC
AAGCTGACCA CCGCGGGCCC GCTGGTCACG CTCAACAGCA ACAACAAGGT GACCAACGCC
GGCACCCTGG CGACGGTGGG CGTCGACGAC TCGACCGGCA TCCTGATCCT CGGCGGCAAC
ACCGGCTCGG TGACCAACTC GCTGGCGATC AGCCTGACCG AGGATTACAC CCCGACCGAC
GACGACAAGA CGGCGGCCGG CGTGGCGGCG CCAGACGGCG ACCTCGACGG TCCGTTCGCC
AAAGGAACCC ACCGTTACGG CATCCGCCTG ACGGGTCCCG GCGTCTTCAC GGGCAACATC
ACCCAGGACT CCGCCGGCTC GATCGTGATC GAGGGCAACA ATTCGGCCGG CATCTCCCTG
GAGACCGGCC TGGTCGGCAA CCTGACCACC AACGGCGCGA TCTCCGTCAC CGGCGACAAC
AGTTACGGCG TCCACCTGAC GGGGCCGGTC ACCGGCAAGG TCACCCAGGG CGGCTCGGTT
TCGGTCCTGG GCGACAAGGC CGTGGGGGTC GCGCTCGACG CCGACATCAG CGGGGCCTTC
GCCCTGCAGG GCTCGGTCGC CGCGACCGGC TACCGCTACA TCACGCGGCC GACCGAAACC
GCGGTCGGCA AGCTCGACAC CGACGACCTG CTGCAGGGCG GGCCGGGCGT GCGCGTCGCG
GGCGACGTCA CCGGCGGCGT GCTGCTGAAC GGGCCGACCA GCTCCACCGT CATCGACGGT
ACGACGACGA CCACCACGAC GGTGTCGTCC TCGACGACGG GGACCGGCTC GATCAGTGTC
TACGGCGGCG CCCCCGCCTT GCTGATCGGG TCCGACAGTC GCGCGATCAC GATCGGCGCC
GTCGGCGCGG CCGACAACGC CTATGCCCTG ATCGCCAAGG GCTCCATTGC GGCGACGGGC
GTGTACGACA AGATCGCCGC CATTGGCGTG CAGATCGGCG GCCAGACCGG CCAGACCACC
ACCTTGGCCG GCGGCGCGCG CCTGGACAGC TCCATCACGG TCAGTTCTCG CGAAGCCAAC
GCCACCGGCG TCAACTTCAC GGCGGGCGCC GTCGCGCCAT CGCTCTGGAA CCGTGGCTCG
ATCAGCGCCG TCAGCGCTTC CTATGCCGGC TCGCTCCTCA CCGGCGACGC ACGCGCCGTG
GTGATCAACG CCGGAGCCAA CGTCGTCAAT CTGAACAACA GCGGCGCCAT CGCGGCCACC
CGTAGCGGCG AGACCGGCGA CGCCGTCGGC ATTCTCGACA GCTCGGGCAC GCTGTCGACG
CTCATCAACA GCGGCTCGAT CACCGCGCAG GTCGTCGCGC CGACGGTCAA GACGGACGGC
ACGACGACCA CCACCACCGC CAAGGCGACA GGCAAGGCGA TCGCCGTCGA CCTAAGCGCT
TCCACGAGCG GCGTCACATT CACCCAGACG GCGCCCACCT CGACCGCCGC GACGAGCACC
TACCCCAGCG CCGACACCGT CACCACGGCC AGCGCCACCT CGACCACCTC GACCCCGACG
ATCATCGGCG AGGTTCGCTT CGGCTCGGGC GCGGACACGT TCAATATCCG GGCCGGCAAC
GTGTTCGGCG ACGTCTCGTT CGGGGCCGGC GCGGACGCCC TCAACATCAG CGGCGGCGGC
GTCCTGCTGG GCGCGCTGAG CGATTCGGAC GGTCTGCTGG CGATCGACAT CGGCAAGGGC
TCCCTGGGCC TGACCAACAC CGGGAACATC AACGCCACCA GCCTGAACAT CGGCGCTGAA
GGCAAGCTGG TGTTCACCGC CGATCCGACC GCCAACAACG GGGCCGGCGC CAACACCCAC
CTGGTGATCT CGGGCGCGGC CAACATCGCT ACCGGCGCCC AGATCGGCCT GCGGTTGACC
GGCCTGCTCA CCGCCCCGAC CACCTACACC GTGATCACCG CCGGCAGCCT GACGGCCGGC
ACGATCAACC AGGATCTGCT GGGCGGCACG CCTTATCTCT ACGTGGCCAG CAGCCGGACC
GACGCCAACA ACGTCTATCT CGACGTCACA CGCCGGACGG CGGCCCAAAT CGGCATGAAC
AAGGCCCAGT CCTCGGCCTA CGACGCAACG TTCGCGGCGC TGAGCAAGGA CAAGGACATC
GCCACCGCCT TCCTCGGCCA GAGCACCAAG GACGGCTTCC TTGGCCTCTA TGACCAGATG
ATGCCCGACC AGGGCGAAGG GTTCTTCGCG GCCCTGCAGA ACGCCAACCA GGCGATCTCG
TCGGCCACGG CCTATCGTCC CGATCCGGGC GACCGCTATG GACCCGACAG CCTGTGGATC
CAGGAGATCA ACACCCTGGT TCGCCGCGAC ACCGGCGACA CCATGGGCTC CGACACCCAG
GCCTTCGGCT TCGTCGGCGG CTATGAGGCC ATGGGCGACG CCGGCGGCGC GCTGGGCCTG
ACCCTGGCCT ATGTCAGCGT CGAGGAACAC GACATCGCCG CCAAGGTCGG CGAGCAGACC
ACCGGCAACT TCGTCCAGTT GGGCGGCTAT TGGCGCCGGG CGATCGGCGG CTGGCGGCTG
AACGCCGGCG GCGGCGGTGG CTTTGGCTGG TACGACGGCG ACCGGACGTT CAATTCCGGC
GACATCAACG GCGACGGAAC TGCTGACGTC CAGCGCCACA ACACCGCCAA GTGGAACGGC
TACACGTTCA ACGCCTTCGC CGGCACGGCT TACGAGGCCA AGTTCGGCCG CTACTTCGCC
CGCCCCGAAG GCCGCCTCGA CTACCTGTAC CTGAGCGAGG GCGAGCGTAA GGAAAAGGGC
GGCGGCGACG GCTTCGACCA TATCGTCCAC AAGCGCAAGT CCAGCAGCCT GACCGGCGAC
GTGGGCGTCG CCTTCGGCGC GGACTACGGC CGCGACCTCT GGTGGCGTCC CGAAGTGCGG
GTCGGCTACC GCCAGACCCT GGCCGGCGAC ATCGGCGACA CCACGTTCGG CTTCACCAAT
GGCGGCGCGC CCGTCACCCT GGCGGCCATG AACGACAAGA ACGGCGCGGT GACGTTGGGC
TTCGCCCTGC GGGCCGGCAC CCCGATGTCC TATCTGGCGC TTGAAGCCAA CGCCGAGGCC
GCCAAGAAGC AGAAGCGCTA TAACCTGAAG CTGACCGGAC GGGCGATGTT CTAG
 
Protein sequence
MQRKVLVATV ATAPLLAMAF GAYAETVVNT ARTTPIATAT AAGAAPGTPD DVKIDAAGSI 
KLTTAGPLVT LNSNNKVTNA GTLATVGVDD STGILILGGN TGSVTNSLAI SLTEDYTPTD
DDKTAAGVAA PDGDLDGPFA KGTHRYGIRL TGPGVFTGNI TQDSAGSIVI EGNNSAGISL
ETGLVGNLTT NGAISVTGDN SYGVHLTGPV TGKVTQGGSV SVLGDKAVGV ALDADISGAF
ALQGSVAATG YRYITRPTET AVGKLDTDDL LQGGPGVRVA GDVTGGVLLN GPTSSTVIDG
TTTTTTTVSS STTGTGSISV YGGAPALLIG SDSRAITIGA VGAADNAYAL IAKGSIAATG
VYDKIAAIGV QIGGQTGQTT TLAGGARLDS SITVSSREAN ATGVNFTAGA VAPSLWNRGS
ISAVSASYAG SLLTGDARAV VINAGANVVN LNNSGAIAAT RSGETGDAVG ILDSSGTLST
LINSGSITAQ VVAPTVKTDG TTTTTTAKAT GKAIAVDLSA STSGVTFTQT APTSTAATST
YPSADTVTTA SATSTTSTPT IIGEVRFGSG ADTFNIRAGN VFGDVSFGAG ADALNISGGG
VLLGALSDSD GLLAIDIGKG SLGLTNTGNI NATSLNIGAE GKLVFTADPT ANNGAGANTH
LVISGAANIA TGAQIGLRLT GLLTAPTTYT VITAGSLTAG TINQDLLGGT PYLYVASSRT
DANNVYLDVT RRTAAQIGMN KAQSSAYDAT FAALSKDKDI ATAFLGQSTK DGFLGLYDQM
MPDQGEGFFA ALQNANQAIS SATAYRPDPG DRYGPDSLWI QEINTLVRRD TGDTMGSDTQ
AFGFVGGYEA MGDAGGALGL TLAYVSVEEH DIAAKVGEQT TGNFVQLGGY WRRAIGGWRL
NAGGGGGFGW YDGDRTFNSG DINGDGTADV QRHNTAKWNG YTFNAFAGTA YEAKFGRYFA
RPEGRLDYLY LSEGERKEKG GGDGFDHIVH KRKSSSLTGD VGVAFGADYG RDLWWRPEVR
VGYRQTLAGD IGDTTFGFTN GGAPVTLAAM NDKNGAVTLG FALRAGTPMS YLALEANAEA
AKKQKRYNLK LTGRAMF