Gene Caci_4080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4080 
Symbol 
ID8335433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4607848 
End bp4609743 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content71% 
IMG OID644957183 
Producttype IV secretory pathway VirB4 protein-like protein 
Protein accessionYP_003114786 
Protein GI256393222 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3451] Type IV secretory pathway, VirB4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.020005 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0495521 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTCA CCAGCCGCGC TCGAATACGG CGCGAGAAGG AAGGCGCAAG CGCGCCGGTC 
CGTGCCCTGG ACGCGATGCC AGCGCCGGAG TCGGTGCAGG TCGCCCCGCG CTATCTGCGC
ATCGGGGAGA TGTACGCCGC CAGCTTCGCG GTCACCGGCT TCCCCTCCGA GGTCGGCCCC
GGCTGGCTCG AGCCGCTGCT GACCTACCCG GGACGGCTGG ACGTCTCGCT GCACATCGAG
CCGATCGAGA CCCAGGTCGC CGCCGAGCGG CTCCGTCGCC AGCGGGCGCG CTTTGAGTCC
GGCCGACGAT CGGACTTCGA GAAGGGACGC CTGGCCGACC CCTACGCCGA AGCCGCCGCC
GAGGACGCCA CCGAACTCGC GTATCGACTC GCCCGCGGCG AAGGTCGCCT GTTCCGTGTC
GGTCTGTACC TCACCGTGTA TGCCGGGTCC GAGAAGGCTC TCGCGGAGCA GACGTCCGCG
GTCAAGTCGC TTGCCTCGGG GCTGCTGCTG CAGATGCAGC CGACCTCGTT CCGGGCGCTG
GCTGGGTGGA CCAGCTGCCT GCCGCTGGGC GTGGACGCGA TCAAGCTGCG CCGCACCTTC
GACACCGCCA GCCTCGCCTC CGCGTTCCCG TTCACCAGCC CCGACCTGTC GCCGATCGAC
CCGACTGACA CCGCCGCCCC GGACGGCATC CTCTACGGCG TAAACGCCGC CTCGAGCGGG
CTGGTCATCT ACGACCGGTG GGCGCAGGAC AACTACAACT CGATCGTGCT GGCCCGCTCC
GGGGCCGGGA AGTCGTTCTT CTCCAAGCTG GAGATCCTGC GCAGCCTCTA CGGCGGGGTC
CAGGTCCTGA TCGTCGACCC GGAGAACGAG TACGAGCGCC TGGCCGCACT CGTCGGCGGC
GCGTACCTCG ACCTCGGCGC AGACGGGGTG CGGTTGAACC CGTTCGACCT TCCGGAGAAC
GCCGAGAAGT CGGCGGGCAA GTCCAACGCC CTGGTCCGCC GCGCGCTGTT CCTGCACACC
TTCATCACCG TCCTCGTCGG GGAGCCGCTC ACCCCGGCCG AGCGCGCCGC GCTGGACCGG
GCGGTCCTGG CCGCATACGC CGCCGTCGGC ATCACCGCCG ACCCGCGCAC CTGGAAGCGG
CAGGCGCCGA TCCTGTCGGA TCTGGCCGAG GCGCTGCACG CGAACTCAGA CCCGGCAGCG
GTCTCGCTCG CGGCGAAGCT GGAGCCGTTC ACCACCGGCT CGTGGGCCGG GCTGTTCGAC
GGCCCGACGA CCACGGCTCC GTCGGGGCAT CTTCAGGTGT TCTGCCTGCG GCACCTGCCC
GAGGAGCTGA AGTCGGTCGG GACGCTGCTG GTCCTGGACA CGGTGTGGCG GCAGGTCACC
TCGCGCGAGC GCCGCCGGCG CTTGGTCGTG GTCGACGAGG CGTGGCTGCT GATGCGCGAG
CCGGAGGGCG CGAAGTTTCT GCTGCGCATG GCCAAGGCCG CGCGCAAGTG GTGGGCCGGT
CTCGCCGTGA TCACGCAGGA CACCGCCGAT GTGCTGTCCA CGGATCTGGG CCGGGCGGTC
GTCGCCAACG CCAGCACGCA GATCCTGCTC CGTCAGGCGC CGCAGGCGCT AGACGCGGTC
GCCGACGCCT TCCACCTGTC CGCAGGCGAG CGGGATTTCC TGGCCGCCGC CCCGGTCGGA
ACCGGACTCC TGGCTGCCGG CGAGCAGCGG GTCGCCTTCG CCGCTGTCGC CTCCGAAACC
GAGTACGCGG TCGCGACGAC CAATCCGGCC GACCTCGTCG GTGAGGACGA GCCGGACGGC
TACTTCGACC CCGACAGCGA AGACCCCGAC GCCGAGGAAT CAGACCTCGC ATTCACCACT
GCCGACCTCG ACGACCCTGA CGGGATGCTC CTTTGA
 
Protein sequence
MTLTSRARIR REKEGASAPV RALDAMPAPE SVQVAPRYLR IGEMYAASFA VTGFPSEVGP 
GWLEPLLTYP GRLDVSLHIE PIETQVAAER LRRQRARFES GRRSDFEKGR LADPYAEAAA
EDATELAYRL ARGEGRLFRV GLYLTVYAGS EKALAEQTSA VKSLASGLLL QMQPTSFRAL
AGWTSCLPLG VDAIKLRRTF DTASLASAFP FTSPDLSPID PTDTAAPDGI LYGVNAASSG
LVIYDRWAQD NYNSIVLARS GAGKSFFSKL EILRSLYGGV QVLIVDPENE YERLAALVGG
AYLDLGADGV RLNPFDLPEN AEKSAGKSNA LVRRALFLHT FITVLVGEPL TPAERAALDR
AVLAAYAAVG ITADPRTWKR QAPILSDLAE ALHANSDPAA VSLAAKLEPF TTGSWAGLFD
GPTTTAPSGH LQVFCLRHLP EELKSVGTLL VLDTVWRQVT SRERRRRLVV VDEAWLLMRE
PEGAKFLLRM AKAARKWWAG LAVITQDTAD VLSTDLGRAV VANASTQILL RQAPQALDAV
ADAFHLSAGE RDFLAAAPVG TGLLAAGEQR VAFAAVASET EYAVATTNPA DLVGEDEPDG
YFDPDSEDPD AEESDLAFTT ADLDDPDGML L