Gene Caul_4107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4107 
Symbol 
ID5901569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4462335 
End bp4463999 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content69% 
IMG OID641564627 
Productpolypeptide-transport-associated domain-containing protein 
Protein accessionYP_001685729 
Protein GI167648066 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2831] Hemolysin activation/secretion protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.416704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.195824 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCGAT CGAGTCTTAT CATGGCGGCG CTGTTGGCGG CGGGCCAGGG CGCCCAGGCT 
CAACAGCCCC CGGCTACCGG GCAGTTGCAG CAAATCCCGC CCGCCGCCGT TCCCCAAAGG
CGAGCCCCGG ACATCCGCAT CGAACGGCCA GGGCCGTCGA CGGACGCCGT TCCCGAAGGC
GCCCGCATCC GGGTGGACAC GCTGCGCGTC ACGGGCGCGA CCCTGTTCAC GGAAGCGGAG
CTGGTGGCGG CGACGGGCTT TACCCCCGGA CGCGACCTTA CCCTTCCTGA CCTTCGCAAT
GCGGCCGCGC GGATCACGCG CTTCTATAAT GATCGTGGCT ATGTGCTGGC CCAGGCCTAT
CTCCCGGCGC AGGACGTCGT GGCGGGGACC GTGACCATCG CCATGGTCGA GGGCCGTTAC
GGCGCGGTCG CCCTGCGCAA CCAATCTGGC GTTTCCGACG CCGTGGCCGG GGGTGTGCTG
AACGGCCTGA ACCGCGGCGA TCCGGTCGCG ATAGCGCCGC TCGACCGGCG CTTGCTGCTG
CTGTCCGACA TCCCCGGCGT CGTCGTCCAT TCGACCCTGT CGCCGGGCGC CGAGGTGGGG
TCTTCGGATC TGACCGTCGA CCTGACCCGC GCGCCCCGGA TCTACGGCAG CCTCGAGGCC
GACAACGCCG GCAATCGCTA CACCGGCGCC TATCGGTTCG GCGGCTCGGT CAATCTCGCC
AATCCGACGG GCAGGGGCGA CCTGATCAGC CTGCGCCTGC TCGCCTCGAC CGAAGGCCTG
GCCTACGGAC GCGCCGCCTG GCAGGCGCCG CTGGGCGACG CCACCGTCGG CGTCGCCTAC
ACCCACATGC AGTATGACCT GGGCCACGAG TTCTCGGCCC TGGACGCCAG CGGCGTGAGC
GACATCGCCA GCCTGTTCGC CAGCTACCCG CTGATCCGCT CGCGAACCGC GAATCTCTAT
GCCCTGGGCA GTATCGACGC GAAATTCCTC AGCGACGAGA TTGGCCTTGT CTCCCAAGTG
TCGGACAAGA CCGTCCGGGC CGTGACGGTC GGCCTGCGCG GGGACTCGCG CGACGACTTC
GGCGGGGGCG GCTGGAACAC GGCCTCCCTA TCCTGGACCT CGGGCGAGCT GGACATCGAA
AGTCCGCTCG AACGGGCCGC CGACGCGGCG GGCGCCCGCT CCCAGGGCGG GTTCAACAAG
CTGCAGTACG CCGTCTCCCG GCTCCAGACG GTGCGTGGAC CGCTGTCGGT CTATGGCGCG
TTGCGCGGAC AGATCGCCAC CGACAATCTC GACAGTTCCG AAAAGATGGA GCTGGGCGGC
GCCTATGGGG TCCGCGCCTA TCCGGAAGGC GAGGCCTACG GCGACCAGGG CTATGTCGCG
ACGGTGGAAG CGCGATGGAT GCTCGACGCG TGGACGCGAC CTCTTCCGGG CCAATTCCAG
CTTGTCGCCT TCGTGGACGC CGGCGCGGTC GACTACGCCA AGGATCCCTG GTTCTCCGGC
CCCAATCATG CCCGACGCAG CGGCGGCGGC CTCGGCGTCA ACTGGTTCGG CCCGGACGAC
CTCAGCGTCC GCGCCGCCTA CGCCCGCCGC TTCAACGACC AGATTTCGAC CTCGGGGCCT
GACCGAAAGG GCCGCGTCTG GTTCCAGATC GTCAAGCTGT TCTGA
 
Protein sequence
MLRSSLIMAA LLAAGQGAQA QQPPATGQLQ QIPPAAVPQR RAPDIRIERP GPSTDAVPEG 
ARIRVDTLRV TGATLFTEAE LVAATGFTPG RDLTLPDLRN AAARITRFYN DRGYVLAQAY
LPAQDVVAGT VTIAMVEGRY GAVALRNQSG VSDAVAGGVL NGLNRGDPVA IAPLDRRLLL
LSDIPGVVVH STLSPGAEVG SSDLTVDLTR APRIYGSLEA DNAGNRYTGA YRFGGSVNLA
NPTGRGDLIS LRLLASTEGL AYGRAAWQAP LGDATVGVAY THMQYDLGHE FSALDASGVS
DIASLFASYP LIRSRTANLY ALGSIDAKFL SDEIGLVSQV SDKTVRAVTV GLRGDSRDDF
GGGGWNTASL SWTSGELDIE SPLERAADAA GARSQGGFNK LQYAVSRLQT VRGPLSVYGA
LRGQIATDNL DSSEKMELGG AYGVRAYPEG EAYGDQGYVA TVEARWMLDA WTRPLPGQFQ
LVAFVDAGAV DYAKDPWFSG PNHARRSGGG LGVNWFGPDD LSVRAAYARR FNDQISTSGP
DRKGRVWFQI VKLF