Gene Caul_0479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0479 
Symbol 
ID5897934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp520198 
End bp521793 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content67% 
IMG OID641560962 
ProductTRAG family protein 
Protein accessionYP_001682111 
Protein GI167644448 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.903949 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCGA CACGCATCCT TTGGGCGCCG GTGATCGTCG TCTTCGTCCT GGTCCTGGCG 
ACGACCTGGG GCGCGACCCA GTGGACGGCC AACGCCCTGG GCTACCAGCC CGAACTTGGC
GCGCCCTGGC TGATCATCGG CGACCATCGC CTCTATCCGC CGCCCGCCTT CTTCTGGTGG
TGGTTCGCCT ATGACGCCTA TGCGCCGCGC ATCTTCCTGC AGGGCGCGGC CATCGCCGTC
TCGGGCGGCC TGCTGTCCCT CGTCGCGGTC ATCGGCATGG CGGTATGGCG GGCTCGGGAG
AGCGGCAAGT CCAGCGCGTT CGGCACGGCG CGGTGGGCGA TCCCACGTGA GGTCCGCGCC
GCCGGACTGC TCGGTCCCGA CGGCGTGATC CTGGGGCGCC TCGACAAGGC CTATCTGCGC
CATGATGGCC CCGAGCATGT ATTGTGCTTT GCCCCGACGC GCTCGGGCAA GGGTGTGGGC
TTGGTCGTTC CCAGCCTGCT GACCTGGCCC GGCTCGGCGA TCGTCCATGA CATCAAGGGC
GAGAACTGGA CCCTGACGGC CGGCTTTCGC GCGACGTTCG GCAAGGTCCT GCTGTTTGAT
CCGACCAACC CCGGGTCCTC GGCCTACAAT CCCCTGCTGG AGATCCGGCG CGGCGTCTTC
GAGGTGCGCG ACGTTCAGAA CGTGGCCGAC ATCCTGGTCG ATCCGGAAGG TTCGCTCGAC
AAGCGCAGCC ACTGGGAAAA GACCAGTCAT TCTCTGCTGG TCGGGACGAT CCTCCACGTC
CTCTATGCCG AACCGGACAA GACCTTGGCC GGGGTCGCGG CCTTCCTGTC CGATCCGCAA
CGGACGATCG AACAGACGCT CGATGCGATG ATGCGCACGC CGCATCTGGG GGCGGACGGG
CCGCACCCCG TCGTCGCCAG CGCGGCGCGC GAACTCAAGA ACAAGAGCGA CAACGAACGC
TCCGGGGTCT TGAGCACGGC GATGACCTTC CTGGGTCTCT ATCGCGACCC GACGGTGGCC
CAGGTGACAC GGCGCTGCGA CTGGCGTATC GCCGATCTCG TCGATGGCGG ACCCTGCACC
CTCTACCTGG TGGTTCCGCC CTCGGACATC AGCCGGACCA AGCCGCTGGT GCGACTGCTG
CTCAACCAGA TCGGCCGTCG CCTGACCGAA CAGTTGGCTG ACACCGCTGG CCGCCAGCGG
GTGCTGCTGA TGCTCGACGA GTTTCCGGCC CTGGGGCGCC TGGACTTCTT CGAGAGCGCC
CTAGCCTTCA TGGCCGGCTA TGGCCTCAAA GCATTTTTGA TCGCCCAATC GCTTAGGAGC
GCGCTGGATG TTCTCCGCCA GGCTGTCCTC CTCGGCCAGA CCCTCGGTGC GGACCACGCA
GGGGATCAGG GCGTTGCGGG CCAGACGCTT CTGCTTGACC AGCAGTTCCA GAGCTCGATA
GCGCCGGCCG CCGGCCGGGA TCTCGAACAT GCCGGTCTCG GCGCCGCCGG CGTCCAGCAC
GGGCCTAACG CTCAGCCCGT GCAACAGCGT GCGCCGGGCA ATGTCGTCGG CCAGTTCGTC
GATCGACACG CCGGCCTTGG TGCGGCGGAC ATTTGA
 
Protein sequence
MSATRILWAP VIVVFVLVLA TTWGATQWTA NALGYQPELG APWLIIGDHR LYPPPAFFWW 
WFAYDAYAPR IFLQGAAIAV SGGLLSLVAV IGMAVWRARE SGKSSAFGTA RWAIPREVRA
AGLLGPDGVI LGRLDKAYLR HDGPEHVLCF APTRSGKGVG LVVPSLLTWP GSAIVHDIKG
ENWTLTAGFR ATFGKVLLFD PTNPGSSAYN PLLEIRRGVF EVRDVQNVAD ILVDPEGSLD
KRSHWEKTSH SLLVGTILHV LYAEPDKTLA GVAAFLSDPQ RTIEQTLDAM MRTPHLGADG
PHPVVASAAR ELKNKSDNER SGVLSTAMTF LGLYRDPTVA QVTRRCDWRI ADLVDGGPCT
LYLVVPPSDI SRTKPLVRLL LNQIGRRLTE QLADTAGRQR VLLMLDEFPA LGRLDFFESA
LAFMAGYGLK AFLIAQSLRS ALDVLRQAVL LGQTLGADHA GDQGVAGQTL LLDQQFQSSI
APAAGRDLEH AGLGAAGVQH GPNAQPVQQR APGNVVGQFV DRHAGLGAAD I