Gene Caci_4031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4031 
Symbol 
ID8335384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4562127 
End bp4563455 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content70% 
IMG OID644957137 
Producttype II secretion system protein E 
Protein accessionYP_003114740 
Protein GI256393176 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.148527 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00766622 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGTCG CGACCTTCGA ACGTGAGGCG CTGACCGATG ACGAGCAGCA CCTGGTGCGC 
AAGCTGCGCG CCTCGGTCGG CGCCCGGCTG GCCGAGCGCG TCGGCGACAA CTCCGGGAGC
GTCGAGCGGG AGCGGGTCGG TCGGGAGCTG ATCGACGACG CGTTGATGGC CCACGCCCGC
GCCGGGCTGG CCGGCGGCGG CCGGGACGTG TTGGACGGCC AGGCCGAGGC CAAGGTGAGC
CGGGCGGTGT TTGACGCGCT GTTCGGCATG GGCGGTCTGC AGCGTTGGCT GGACGACCCG
TCGGTGGAGA ACATCACGGC CAATGGCTTC GACCATGTCT TCGTCCACTA CTCCGATGGC
AGCAAGGTCC CGGTTGGGCC GCTGGCCGCG TCCGAGGAGG ACTTCGTCGA CCTGCTGCGT
ACGATCGCGG CACGGGCCGG CACCGAGGAG CGGATCTTCG ACCGTGCGCA TCCGCAGCTG
AACATCCAGC TGCCCGACGG GTCCCGGCTG TTCGCGGTGA TGGCAGTGTC CCGCCGGGTG
TCGCTGACCG TGCGCAAGCA CCGCCACATG GCCACCTCGC TGCGGGAGCT CGCGGCCCTG
GGCATGTTCG ACCTCGACAT CGCCGCACAG CTGGAGGCCG CGGTACGGGC CAAGCTGAAC
ATCCTCATCT GCGGCGCCAT GGGCGGCGGC AAGACCACGG TGCTGCGCGG CCTGGCCGCG
TGCATCGGAC CGGAAGAGCG GCTGGTCACC ATCGAGGACA CCTACGAACT CGGGCTGGAG
CAGGCGCACC CCGACGTGGT GGCGATGCAG GCCCGCGAGG GGAACCTGGA GGGCCAGGGC
GCCGTGTCGC AAGCCGAGCT GGTGCGGATG TCGCTACGGA TGAACGCCTC CCGCGTCATC
GTCGGCGAGG TCCGCGGCGA GGAACTGGTG CCCATGCTCA ACGCCATGAC CATGGGAACC
GACGGCTCGC TGGGCACCAT CCACGCCTCG TCGTCCAAGC AGGCGTTCGA CAAGATGGCC
ACCTACGCCA TCCAGTCACC GGAGCGCCTC GACCGCGCGG CGACGAACCT GCTGGTCGGC
ACCGCCCTGC ACGTCGTGAT CCAGCTGGGT CGGCTGCGCG ACGGCACCCG CGTACTGTCC
TCGATCCGGG AGATCACCGG CGTCGGGGAC AACGGCGAGG TCACCAGCAA CGAGGTCTAC
AAGCCCGGCC GCGACGGCCA GGCCGTACCC GGAACCGGCT GGACCGCCGG CACCGCGCAG
CGCCTGATCG ACGCAGGGCT GGATGAGGAT GTGCTCACGC GTTCGGCGCG CGCGGGGTGG
TCGATATGA
 
Protein sequence
MSVATFEREA LTDDEQHLVR KLRASVGARL AERVGDNSGS VERERVGREL IDDALMAHAR 
AGLAGGGRDV LDGQAEAKVS RAVFDALFGM GGLQRWLDDP SVENITANGF DHVFVHYSDG
SKVPVGPLAA SEEDFVDLLR TIAARAGTEE RIFDRAHPQL NIQLPDGSRL FAVMAVSRRV
SLTVRKHRHM ATSLRELAAL GMFDLDIAAQ LEAAVRAKLN ILICGAMGGG KTTVLRGLAA
CIGPEERLVT IEDTYELGLE QAHPDVVAMQ AREGNLEGQG AVSQAELVRM SLRMNASRVI
VGEVRGEELV PMLNAMTMGT DGSLGTIHAS SSKQAFDKMA TYAIQSPERL DRAATNLLVG
TALHVVIQLG RLRDGTRVLS SIREITGVGD NGEVTSNEVY KPGRDGQAVP GTGWTAGTAQ
RLIDAGLDED VLTRSARAGW SI