Gene Caci_1359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1359 
Symbol 
ID8332697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1546758 
End bp1548476 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content68% 
IMG OID644954507 
ProductNa+/solute symporter 
Protein accessionYP_003112123 
Protein GI256390559 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCACT CGACCGCCTC AAAGGTCCCG ACTCCTGACG TCGGCACCCA CGGCATCAAC 
ACGACCGAGC TGGTCGTCGT CGTCTTCTTC TTCCTGCTGG TCTCGGTCCT CGGCTTCCTG
GCGGCGCGCT GGCGTAAGGC AAGCCAGGCC AACCACCTGG AGGAGTGGGG TCTGGGCGGC
CGCAGCTTCG GCGGCTGGAT CACCTGGTTC CTGCTCGGCG GCGACCTGTA CACCGCGTAC
ACGTTCGTCG CGGTGCCGGC GGCGCTGACC GCGGGGGCGT TCGGCTTCTT CGCCGTGCCG
TACACGATTA TCGTGTGGCC GCTGGTGTTC CTGTTCCTGC CGCGGCTGTG GTCGGTCTCG
CGCAAACGCG GGTACGTCAC ACCCGCGGAC TTCGCACGCG GGCGGTACGG CTCGAAATCG
CTGGGACTCG CGGTCGCCAT CGTCGGCGTG GTGGCGACGA TGCCGTACAT CGCGCTGCAG
CTGGTCGGCA TCCAGGCCTG CCTGGACGTG ATCGGCATCG GCGGCAAGGG CTCCTCGACG
TTCGCCAAGG ACCTGCCGCT GTTCATCGCC TTCGCGGTGC TGGCGGCGTT CACCTACACC
TCGGGGCTGC GTGCGCCGGC GGTGATCGCG TTCGTCAAGG ACTTCCTGAT CTACCTGGTC
ATCATCGTCG CGATCATCTA CATCCCGACC CGGATCGCCG GCGGCTGGCA CGGCATCTTC
CACCTGGCGT CCACCGGCGC CAACGGCAAG ACCAAGCCGG GCTTCCTGTC TCTGAACGAC
CACAACGGCA AGACCAACGC CTTCCCGTAC GCCACGCTGG CGCTCGGCTC GGCGATGGCG
CTGTTCATGT ACCCGCACGC GCAGATCGGC GTGCTGTCCA CCAAGTCCCG CAACACGGTC
CGCAAGAACC TCGCGGGCCT GTCGCTGTAC TCGCTGGTGC TCGGCTTCAT CGCGCTGCTG
GGCTACATGG CGCTGGCCGT CGGCTTCAAC GGCACCGCCA AGGGCCACCT CGGCGGCAAC
GCGCAGCGCG CGGTCCCGGC GCTGTTCGAC GCGGTGTTCC CGAGCTGGTT CGCCGGCGTC
GCCTTCGCCG CGGTGGCGAT CGGCGCGCTG GTCCCGGCGG CGATCATGTC GATCGCCGCG
GCGAACCTGT TCACCCGCAA CATCTTCGTG GACTTCATCA AGCCCGACGC CACCCCGCAC
CAGCAGGCAC AGGTGTCCAA GACGGTGTCG CTCCTGGTGA AGTTCGGCGC CCTGGCCTTC
GTCCTGGGCC TGGACGCGAC CAGCGCGATC AACTTCCAGC TGCTCGGCGG CGTCCTGATC
CTGCAGACGT TCCCCGCGAT CGTCGTGGGC CTGTTCAACC GCTGGTTCCA CCGCTGGGCC
CTGGTCGCGG GCCTGTGGAC CGGCGTGATC TACGGCGTCG TGGTCGCCTG GCAGCAGAAG
AAGTTCGCCG CCGACGGCAA AACCGTCACC CAGCACCACT TCGGCAGCCA GATCGCCAAG
GTCCCCGGCA CGCATCTGTT CTCCTACATC GCCATCACCG CCCTGGTGGT GAACCTGATC
GTCGCGGTGG TCCTGACCCT GGTCTTCCGC GCCCTGAAGG TCGCCGACGG CATCGACGAG
ACCCAGTCCG CCGACTACAC CGCCGACGAG GGCGACGCGG ACCTGCCGAA CCTGATCGAG
GCAGAGCCGG CGCTCGCGGA GATCGCGGAG TCGAGCTAG
 
Protein sequence
MPHSTASKVP TPDVGTHGIN TTELVVVVFF FLLVSVLGFL AARWRKASQA NHLEEWGLGG 
RSFGGWITWF LLGGDLYTAY TFVAVPAALT AGAFGFFAVP YTIIVWPLVF LFLPRLWSVS
RKRGYVTPAD FARGRYGSKS LGLAVAIVGV VATMPYIALQ LVGIQACLDV IGIGGKGSST
FAKDLPLFIA FAVLAAFTYT SGLRAPAVIA FVKDFLIYLV IIVAIIYIPT RIAGGWHGIF
HLASTGANGK TKPGFLSLND HNGKTNAFPY ATLALGSAMA LFMYPHAQIG VLSTKSRNTV
RKNLAGLSLY SLVLGFIALL GYMALAVGFN GTAKGHLGGN AQRAVPALFD AVFPSWFAGV
AFAAVAIGAL VPAAIMSIAA ANLFTRNIFV DFIKPDATPH QQAQVSKTVS LLVKFGALAF
VLGLDATSAI NFQLLGGVLI LQTFPAIVVG LFNRWFHRWA LVAGLWTGVI YGVVVAWQQK
KFAADGKTVT QHHFGSQIAK VPGTHLFSYI AITALVVNLI VAVVLTLVFR ALKVADGIDE
TQSADYTADE GDADLPNLIE AEPALAEIAE SS