Gene Caul_2148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2148 
Symbol 
ID5899603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2327114 
End bp2328607 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content69% 
IMG OID641562638 
ProductTolC family type I secretion outer membrane protein 
Protein accessionYP_001683774 
Protein GI167646111 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01844] type I secretion outer membrane protein, TolC family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.400858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0489946 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAATA ACCGCCGAGC CGGTTTGCTG GCCGCCGCTT GCAGCATGGG CCTGATGGCC 
GGTCTGGTTT CGGCCGCGAG CGCCGAAACT CTGAACGACG CCCTGGCCCT GGCTTACCAG
ACCAATCCCA CCCTTCAGGC TCAGCGCGCC AACCAGCGCG TGACCGACGA GGGCGTGGTC
CAGGCCAAGT CGGCCTTCCG ACCCAATCTG AGCGGTTCGG CCGACGTCAC CGGTCAGCGG
ACGGACTACG CCAAGCCCAG GCAATCGCTC GTCGGCAACA CGATCGTCAA CAAAGACCAA
CAGACCGCCT ACGGCAGCGG GGGCAGCTTT TCCCTGTCGC AGCCGCTCTA TACCGGCGGC
CGCGCGAGCT CCAACCTGAC CGCCGCCGAG GCCGACGTGA TGGCCGGCCG CGAGGATCTG
CGCAGCGTCG AGCAGTCGGT GCTGGGCAAT GTCATCCAGG CCTATGTCGA TGTCCGCCGC
GACCAGGAGC GCCTGCGCAT CGCCCAGGAG AACGTCAGCG TCCTGAACCG TCAGCTCGAA
GAGGCCCGCG CCCGTTTCGA GGTCGGCGAG ATCACCCGCA CCGACGTGGC CCAGTCCGAA
GCCCGTCTGG CCGGCGGCCA GGCCAGCCAG TCGTCGGCCC AGGCGATCCT GGCCGGCAGC
CGCGCGGCCT ACGCCGCCGT GGTCGGCCAG AACCCTACCA ACCTGGCGCC GGAACCGTCG
CTGGCCGCCC TGCTGCCCGC CAGCGTCGAG CAGGCCTTCG ACTTGGTCGA CCAGAGCAAT
CCGCAGATCC AGGCCGCCCG CTACGCCGAA CGCGCCGCCG CGGCGCGCGT GGCCCTGGCC
AAGGCCGCCA TGCGTCCCAC GGTTTCGGCG CGCGCCGGCC TGGGCTGGGA GTCCGAGGGC
CGGGTGGACG GCAAGGGCAA CCAGTTCGGC GACTATGATC GCGGGATCAA CGGTTCGATC
ACCGCCTCGG TGCCGATCTT CACCGGCGGT CTGACCAGCT CGCAGATCCG CGCCGCCAAG
GAGCGCGAGA ACGCCGCTCA CGTCGCCGTC GAGGGCGCCA AGCGCACCGC CCTGCAACAG
ATCTCGACCG CCTGGAACAA CCTGCTGGCC GCCCGCGCCA ACCTCGTCTC CAACGAGGAG
CAGGTTCGCG CCGCCCGGAT CGCCTTCGAA GGCGTGCGCC AGGAACAGCA GGTCGGTCTG
CGCACCACCC TGGACGTGCT CAACGCCCAG CTGGAGCTGT CCAACGCCGA GGTGGCCCTG
GTCATCGCCC GCCACGACGA ATATGTCGCC AGCGCCAGCG TCCTGCAGGC CATGGGCGTG
CTGAACGTCG CCAACCTGGC GCCGGACGTC GAACGCTACG ATCCGGTGAA GTCCTACAAC
AGGGTCAACC ACGCCATCGG CTGGGTGCCG TGGGAGCCGG TGGTGCAGGT GATCGACAAG
ATCGGCGCGC CCTCGACGGC GGTCAGCAAC CCCACGCCCG TCGCCGCCAA GTAG
 
Protein sequence
MSNNRRAGLL AAACSMGLMA GLVSAASAET LNDALALAYQ TNPTLQAQRA NQRVTDEGVV 
QAKSAFRPNL SGSADVTGQR TDYAKPRQSL VGNTIVNKDQ QTAYGSGGSF SLSQPLYTGG
RASSNLTAAE ADVMAGREDL RSVEQSVLGN VIQAYVDVRR DQERLRIAQE NVSVLNRQLE
EARARFEVGE ITRTDVAQSE ARLAGGQASQ SSAQAILAGS RAAYAAVVGQ NPTNLAPEPS
LAALLPASVE QAFDLVDQSN PQIQAARYAE RAAAARVALA KAAMRPTVSA RAGLGWESEG
RVDGKGNQFG DYDRGINGSI TASVPIFTGG LTSSQIRAAK ERENAAHVAV EGAKRTALQQ
ISTAWNNLLA ARANLVSNEE QVRAARIAFE GVRQEQQVGL RTTLDVLNAQ LELSNAEVAL
VIARHDEYVA SASVLQAMGV LNVANLAPDV ERYDPVKSYN RVNHAIGWVP WEPVVQVIDK
IGAPSTAVSN PTPVAAK