Gene Caci_6695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6695 
Symbol 
ID8338059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7713514 
End bp7715004 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content71% 
IMG OID644959789 
ProductMonosaccharide-transporting ATPase 
Protein accessionYP_003117382 
Protein GI256395818 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4214] ABC-type xylose transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.159746 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.276743 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA ACCCGAACCC CGGCGACCCG GGCGGCCCGG GCGGCTCCGA GCACGGCGAG 
GACGCCGCGC CGCTCGGCCT GTCCGGCGCC GGCGACACCG CCGACGAGCG CGGGGCACCG
GTCGGCAGCC CCCAGTCGGT CACCGGCAAG GTCGAGGAAC TCCTGGCCGC CGGCGCCACC
CCGGCGGAGG CCACGGCGCA GGCCGCGAAG GCGACCGGGG AGTCCAGGGA GGTCGTCCGG
GAGATCGTCG AGACGATGGT CCCGGCGGCC GGTTCGGCTG CCGACCCGCG GCTGTTGCAG
CAACAGGCCG GCCTGGCCGG CTACTGGGCG GCGTTCGTGC GCCGGCTCAA GGGCGGCGAG
CTCGGCTCGC TGCCGGTGGT CGCGGCGCTG ATCATCATCT GGATCGTGTT CTACGCCCTG
AACAGCACAT TCCTGTCGGC GCAGAACCTG TCCAACCTCT CCCAGCAGAT CGTCGGCACC
GGGATGATCG CCCTGGGCAT CGTCTTCGTG CTGCTGCTCG GCGAGATCGA CCTGGCGGCG
GGCTCGGTGT CGGGTCTGGC GGCCGCGGTG TTCGCCGTGG AGTCGGTGAA CAACGGGGTC
AACCAGTATC TGGCGCTGCT GCTGGCCCTG GCCACCGGCG CCGGGACCGG GTTCGTCCAC
GGCTTCTTCT TCGCCCGGAT CGGCGTGCCG GCGTTCGTCG TCACCCTGGC CGGCAACCTG
GGCTGGAACG GCCTGATGCT CAACATTCTG GGCTCCACCG GCACCGTCAA CCTGCCCAAC
AACGGCATCG TCTCCAAGCT CTACAACACG ATCTACGGCC AACTCGCCGC GGCGTACGGC
GCCGCGATCA TCGCGGTCGT GCTCTACGCG CTGGTGGCCC TGTACGGCCG GGCGCGCCGG
GTCAGGGCCG GGATCCCGGC GCCGCCGATC GGCGAGATCG CGGCGCGGGT GGTCCTGCTG
GCGATCGTCG CCTTCCTCAC GGCCTACGTG TTCAACCAGT ACAAGGGCCT GCCGCTGGCG
CCGCTGATCT TCCTGATCTT CATCGTGGTC GGCGACTTCA TCCTGCGCCG CACGGTCTAC
GGCCGCCGCA TCTTCGCCGT CGGCGGCAAC ATCGAGGCCG CCCGGCGGGC CGGTATCAGC
GTGCCGTTCA TCCGGCTCAC GGTCTTCATG ATCAGCGGCC TGATGGCCGC GGTCGGCGGT
CTGTTCCTGG CCGGCCAGAT CGAGTCCGCC TCCCAGACCT CCGGCGGCGG CAACCTGCTG
CTGAACGCGA TCGCCGCGGC GGTCATCGGC GGCACGAGCC TGTTCGGCGG ACGCGGCAAG
ACCTGGTCGG CGCTGCTCGG TGCGCTGGTC ATCGGCTCGA TCCAGTCCGG CATGAACATC
CAGGGCCTGT CGAACAGCAT CCAGTTCATG GTCACCGGCG CCGTGCTGCT GGCCGCGGTG
GTCATCGACT CCGTGGCGCG GCGGACGCAG AAGGCGAGCG GTCGCGTTTA G
 
Protein sequence
MSQNPNPGDP GGPGGSEHGE DAAPLGLSGA GDTADERGAP VGSPQSVTGK VEELLAAGAT 
PAEATAQAAK ATGESREVVR EIVETMVPAA GSAADPRLLQ QQAGLAGYWA AFVRRLKGGE
LGSLPVVAAL IIIWIVFYAL NSTFLSAQNL SNLSQQIVGT GMIALGIVFV LLLGEIDLAA
GSVSGLAAAV FAVESVNNGV NQYLALLLAL ATGAGTGFVH GFFFARIGVP AFVVTLAGNL
GWNGLMLNIL GSTGTVNLPN NGIVSKLYNT IYGQLAAAYG AAIIAVVLYA LVALYGRARR
VRAGIPAPPI GEIAARVVLL AIVAFLTAYV FNQYKGLPLA PLIFLIFIVV GDFILRRTVY
GRRIFAVGGN IEAARRAGIS VPFIRLTVFM ISGLMAAVGG LFLAGQIESA SQTSGGGNLL
LNAIAAAVIG GTSLFGGRGK TWSALLGALV IGSIQSGMNI QGLSNSIQFM VTGAVLLAAV
VIDSVARRTQ KASGRV