Gene Caul_2104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2104 
Symbol 
ID5899559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2250770 
End bp2252386 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content67% 
IMG OID641562593 
Productpolypeptide-transport-associated domain-containing protein 
Protein accessionYP_001683730 
Protein GI167646067 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2831] Hemolysin activation/secretion protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.770543 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTCGGC CGATCATGGA CGACGACCGA TCCGGCGAGC TTCCTCCGAG GCGCGCCTGG 
GCGTCCGTGG CGGCGATCCT GGCGGTTTGC GCCTCTGGAC AGGCGGCCCT TGCCCAGACA
GCAGCCGGTC AGGAGCCGAC CTTCGACGTT CAGGCCATCG ACGTCGATGG CAACACGGTC
CTGGACCAGG CCTCGCTTGA GGCCGCGATC TATCCCTTCA TGGGTCCGGC TCGAACCCGC
GCCGACGTGG CGAACGCCCA GAAGGCCCTC GAAGACGCCT ATCACGCGCG CGGATACCAG
ACCGTCGTCG TCGAGGTGCC GCGCCAGAAC GTCTCCACCG GCGTTCTCAA ACTGCACGTG
GTTGAGGCGC CGGTCGGCCG CCTGCGCGTC GTTGGATCGA AGTATCACTC GCTGGACCGG
GTCAGGGAAG ATGTCCCGGC CTTGGCCGAG GGGCGGGTCG CCAACCTCAA CGACGCCCAG
GCCGAGATCA ACGCCGCCAA CCGCCTGCCG GACCGTCAGA TCACGCCGAT CATTCGCGCC
GGCCTCGCGC CGGGCACGGT CGACATCGAC CTGCAGGTGG CCGAGGAGCC GCCGCTGCAT
GGCAGCCTGG AACTGAACAA CGACCGCAGC GCCTTCACCA AGCCTCTCAG GCTGAGCGGC
AATCTTCGCT ACGACAATCT GTGGCAGCGC GGCCACAGTG TTTCGCTGAC CTATCAGGTG
GCGCCGCAGC GACCGGACGA CACCGAGATC TATGCGGGGT CCTATGTCGC ACCGATCTGG
GGCACGCCAT TCACGGCGCT GCTCTATGGT TTCCATTCGA ATAGCAACGT CGCCACCCTG
GGCGAGGTCG CCGTGCTGGG CAAGGGCACG ACCGTTGGTA CACGGTTGAT CTACCAGTTC
CCGCCGAGCG GGGCGGTCAG CCAGAGCTTC TCGGCCGGGC TGGACTACAA GAAGTTCTTC
GAGCTCGTGA CCCTGAACGC CGCCGGCGTG AGCTCTGGCC AAGTCGAGTA CTGGCCCCTG
ACCGCCAGCT ACACCTGGCG TCATCAGGCT CGCGGCAGCA CGACGGCGAT CGTCTCGCTC
ACGGCCAATA TCCGCGGCTT CGGCGACGAC GACGAGGGCT TCCAGTCGCG GCGGTTCGAT
GCCCGGGCCA ACTTCGTCCA CCTGAACCTT GAGGCCGAGC ACACGAGACC CGTGGGCCAC
GGTTTCGACG CGACGTTCCG CTTCACCGGC CAGTTGGCCG ACATGCCGCT GCCGTCGGCC
GAGCAGTTCG CGGCGGGCGG GCTTTCCAGC GTTCGCGGCT ATCTGTCGGC CGAGGTGGTC
GGCGATGGCG GGATCTTCGG ATCAATCGAG CTGGCCAGCC CGACCCTGCC CGGCGTGGCC
GGACGGATCG ATAATTGGCG CGTCTACGGC TTCCTCGACG GCGCCGGCGC CTGGGTGCTG
CAGCCCCAGC CGGAACAGCG GGAGGAGTTC TTCCTCTACA GCGCCGGTCT CGGCACGCGG
GTGCAACTGC TCAAGCACCT CAACGGCGAG GTCGTCGCCG CCTTCCCCCT GCGAAACGGG
CCCAACCGCA CCGACGGTCG CCCCTATTTC ACCTTCAGCC TGAAAGCGGA GTTCTAG
 
Protein sequence
MARPIMDDDR SGELPPRRAW ASVAAILAVC ASGQAALAQT AAGQEPTFDV QAIDVDGNTV 
LDQASLEAAI YPFMGPARTR ADVANAQKAL EDAYHARGYQ TVVVEVPRQN VSTGVLKLHV
VEAPVGRLRV VGSKYHSLDR VREDVPALAE GRVANLNDAQ AEINAANRLP DRQITPIIRA
GLAPGTVDID LQVAEEPPLH GSLELNNDRS AFTKPLRLSG NLRYDNLWQR GHSVSLTYQV
APQRPDDTEI YAGSYVAPIW GTPFTALLYG FHSNSNVATL GEVAVLGKGT TVGTRLIYQF
PPSGAVSQSF SAGLDYKKFF ELVTLNAAGV SSGQVEYWPL TASYTWRHQA RGSTTAIVSL
TANIRGFGDD DEGFQSRRFD ARANFVHLNL EAEHTRPVGH GFDATFRFTG QLADMPLPSA
EQFAAGGLSS VRGYLSAEVV GDGGIFGSIE LASPTLPGVA GRIDNWRVYG FLDGAGAWVL
QPQPEQREEF FLYSAGLGTR VQLLKHLNGE VVAAFPLRNG PNRTDGRPYF TFSLKAEF