Gene Caci_4390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4390 
Symbol 
ID8335744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4982500 
End bp4984458 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content69% 
IMG OID644957493 
ProductRhs element Vgr protein 
Protein accessionYP_003115095 
Protein GI256393531 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.86257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.698434 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTCG GCTCCTTCTC CTCGGTCCCC AAGGTCGAGA TCGGCGGCAC GCTGCCCCGG 
CTCCTGAAGG CCTCCCTGGA CTCCTGCTGG GTGGAGTCCA CCCTGAACGT GCCCTCCACC
TTCCACATCG CGTTCAAGGA CAAGACCCGC CTGCTGATGT CCACCAACGC GCAGCTGAAG
ATCGGCGCGC CGGTGACGAT CTTCGCCGTC GCCGGGCTCA TCGGCGAGGA CCAGCCGCTG
ATCACCGGGC AGGTCACCGG GATCGAGGCG GACTACTCCG GCGGCGACTT CTACACCGTC
ATCCGCGGCA TGGACCACGC CTTCAAGCTG CTGCGCAAGC GGCGGGTGGC GCTGTACAAG
AACATGTCCG CCTCCGACAT CGTGCGCCAG GTCGCCGGGC AGCACGGGGT GGCGATCGGC
AAGATCGAGT CCACGCCCCC GCCGCCTCCG GACTCCCAGA CCTCCCAGCC CAACGTCGAC
GACTGGACCT TCCTGCAGAG TCTGGCCGAG CGCGCCGGCA AGGTCGTGTA CTTCGACAAC
AAGGGCCTGC TGCACTTCCG CGCGCCGGTC AAGGCCGTGC CGCTGACCGG GCTCAGCGCC
GACAAGAGCC CGTACGTGCT CGAATTCGGT GCCAACACCC TGCGCTGCCG CTCCGGCTTC
ACCGCCGCCG ACCAGGTCTC CATGGTCAGC TCGCGGGGCT GGAACATGGT CACCAAGCAG
ACCCTGATCG GCCGGGCGCA GGCCGCGGCG AACCCCGACG TGCTGGCCGG ACTGAGCCCG
GCGCAGGTCT CCCGGCCCTT CGGCACCGGC ACGCTGGTGG AGACCGGCAC CCCGTATGTC
AGCCAGAGCG AGACCGACTC GGCGGCCAAG TCCCTGGCCA CCGACGTGAC CTCGTCCTTC
GCCGAGCTCG AGGTCGCCGT CCGGGGCACC CCGCAGCTGC TGCCCGACAA GTCGGTCACC
CTGACCAAGG CCGGCACGCC CTTCGACGGC GCCTACACCG TCACCGGCGT CCGCCACCTG
TTCGAGCACG GCACGTACGA GACCTGGGTG TCCATGACCG GCCGGCAGTT CCGCTCGCTC
TACGGCCTGG CCTCCGGCGG CGCGCACGGT CCCGGCGGCA CCGGGCAGCG GATGAGCGGC
GTGGTCAGCG GGATCGTCAC CGACATCCAC GACCCGCTGC GCATGGGCCG GGTCAAGCTG
CGCTTCCCCT GGCTCGACGA CGACTACGTC AGCGACTGGG CCCGCACCGT CCAGCACGGC
GGCGTCAGCG TGCACGGCGG TCTGGAGCCG GCCCACGGCG CCGACCACAT CCCCGCCGGC
TCCCCGGGCT CCGGCGGCTT CATCGGCTAC GCCGTCAACG ACGAGGTGCT GGTCACCTTC
GACCGCGGCG ACTTCGACCA GCCCTACGTC ATCGGCGGGC TCTACAACGG CGTGAACAAG
CCGACCCGCT TCACCGAGGA CAACCTGGTC TCCAAGGACG GCATCCCCAA CGTGCTCGCC
GTGTCCTCCC GGCGCGGCAA CCGGCTGGAG CTTCTGGACG ACGAGCTCGG CATGAAGGCC
GGGGTCAAGG TCCTGACGAG CGACGAGAAA CAGAGCATCG AGCTGGACAA GATGACCAAG
ACCAGCACCG TCAAGAACTC CGTCGGCCCG ATCATGGTCG AGAGCAACGC CCCTGACGGC
CGGGTGACGA TCCGCAGCGG CGCGGCCAGC ATCACCCTGA CCGCCGAGGG CACCGTGAAC
ATCGAGGGCG CCACCGAGGT CAGCGTCTCG GCCGGCGCGA TGCTCTCGCT CAAGGCCGCC
GAGCTGAACC TGGCCGCGCC GGTCACCACG GTGGAGTCCG CGGAGATCAA CTTCGCCGGC
GCGTCGTTCT CCGTCGAGGC GGCCGAGATC ACGCTGACCG GCAACGTGGC CATCGTCGGC
GAGGGCACGA TCGACGCCCA GCAGATCGTG GTGATCTGA
 
Protein sequence
MTFGSFSSVP KVEIGGTLPR LLKASLDSCW VESTLNVPST FHIAFKDKTR LLMSTNAQLK 
IGAPVTIFAV AGLIGEDQPL ITGQVTGIEA DYSGGDFYTV IRGMDHAFKL LRKRRVALYK
NMSASDIVRQ VAGQHGVAIG KIESTPPPPP DSQTSQPNVD DWTFLQSLAE RAGKVVYFDN
KGLLHFRAPV KAVPLTGLSA DKSPYVLEFG ANTLRCRSGF TAADQVSMVS SRGWNMVTKQ
TLIGRAQAAA NPDVLAGLSP AQVSRPFGTG TLVETGTPYV SQSETDSAAK SLATDVTSSF
AELEVAVRGT PQLLPDKSVT LTKAGTPFDG AYTVTGVRHL FEHGTYETWV SMTGRQFRSL
YGLASGGAHG PGGTGQRMSG VVSGIVTDIH DPLRMGRVKL RFPWLDDDYV SDWARTVQHG
GVSVHGGLEP AHGADHIPAG SPGSGGFIGY AVNDEVLVTF DRGDFDQPYV IGGLYNGVNK
PTRFTEDNLV SKDGIPNVLA VSSRRGNRLE LLDDELGMKA GVKVLTSDEK QSIELDKMTK
TSTVKNSVGP IMVESNAPDG RVTIRSGAAS ITLTAEGTVN IEGATEVSVS AGAMLSLKAA
ELNLAAPVTT VESAEINFAG ASFSVEAAEI TLTGNVAIVG EGTIDAQQIV VI