Gene Caci_3199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3199 
Symbol 
ID8334552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3526968 
End bp3528266 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content70% 
IMG OID644956344 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003113947 
Protein GI256392383 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0389134 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTCG ATCGTCTTAC TGCTGCGCAT GCTGGGCCGT CGGCTTCGGG AGGGCTGACG 
GCGGCGGGTA AACGGTTCCG GTTCTGGGGG ACGGCCTACA CCTTCCTGAT CCTGCTGACC
GGGACGAATC TGCCGACACC CCTGTACAAG GGCTACGAGG CGCGGTTCGG GTTCTCGCCG
CTGACGCTGA CGCTGATCTT CACCGCCTAT GTCGCGGTCC TGATTCCCTC GCTGTTGGTG
GTCGGGCCGG CTGCCGACGC GATCGGGTAT CGCGTCATGC TGCTGCCTGC GCTGTTCGTG
GCGGCGGGTG GGGCGCTGGT GTTCGCGTTC GCGTCCGGAA CCGGGTGGCT GTTCGCGGGG
CGCATCTTGC AGGGCGTCGC GATCGGGGCG GCGACCGGAC CGCTGACGGC GACGCTGACC
GAGCTCGAAC CGCACGGCGA CCGGCGCAAG GCGGCGTTGG TCTCGACGGT GGCCACGGCC
GGAGGACTCG GACTCGGGCC GCTGCTGGCG GGTTTCCTCG CGCAGTACGC GCCGGCGCCG
CGCGTGCTGC CGTTCGCGCT GGAGATCGGG CTGCTGGTCC CGGCGGTGGC ACTGGTGCTG
ACGCTGCCGG CGAACCGTGC CCGCACGCGG TGGCGCCCGC GCCGTCCGGA GATCCCGGCC
GCCCTGCGTT CGGAGTTCGC GACGAGCGGG ACGGCGTGCT TCGCAGCGTT CGCGGTGGTG
GGACTGTTCC TGACGCTGAT TCCGACCTAC GTCGCGACGC TGTCCGGGAG CAAGAACCTG
CTCCTCGGCG GCGCGGCGGT GGCGCTGATG CTGGCGTGCT CGGCGATCGC GCAGGTAGTC
GGGTACGGGA AATCGGCGCG CGGGCTGGAG ATCGCAGGGC TTCCGCTACT GGCGGTGGGG
CTGGTGTCGC TGGCGATAGC AGGGAACGTG TCGTCGCTGG CGCTGTTGCT CGGCGCGACA
GTCGTAGCCG GAGCGGGGCA GGGACTGACG TTCCTCGGCG GGCTGACGGC GATCAACGCG
GTGGCGCCGG CGGATCGGCG AGCCGATGTG CTGTCGAGCT TCTTCGTGAT CCTCTATTTG
GGCGTCGGCG TGCCGGTGGT GGGAGTGGGC TTCGTCGCGA CGCAGGTGGG TTTGCTGGCG
GCGGTTCAGT ACTTCGCGTG GGGTGCGGCG GTGTTGTGCG TGGTGGTGCT GGCGGTGCTG
GGGCGCAGAC GTACACGCGA AGGGGATGGG AAAACGGGGC TGGCGGTCGA GGCAGCTGAG
GCTGGAACCG CCGCGCGGCG TCGGTTGACC GACCGGTAG
 
Protein sequence
MALDRLTAAH AGPSASGGLT AAGKRFRFWG TAYTFLILLT GTNLPTPLYK GYEARFGFSP 
LTLTLIFTAY VAVLIPSLLV VGPAADAIGY RVMLLPALFV AAGGALVFAF ASGTGWLFAG
RILQGVAIGA ATGPLTATLT ELEPHGDRRK AALVSTVATA GGLGLGPLLA GFLAQYAPAP
RVLPFALEIG LLVPAVALVL TLPANRARTR WRPRRPEIPA ALRSEFATSG TACFAAFAVV
GLFLTLIPTY VATLSGSKNL LLGGAAVALM LACSAIAQVV GYGKSARGLE IAGLPLLAVG
LVSLAIAGNV SSLALLLGAT VVAGAGQGLT FLGGLTAINA VAPADRRADV LSSFFVILYL
GVGVPVVGVG FVATQVGLLA AVQYFAWGAA VLCVVVLAVL GRRRTREGDG KTGLAVEAAE
AGTAARRRLT DR