Gene Cag_1862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1862 
Symbol 
ID3747014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2370424 
End bp2371914 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content49% 
IMG OID637774399 
Productpolysaccharide efflux transporter, putative 
Protein accessionYP_380155 
Protein GI78189817 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGAA ATTCGTTAGT TGCAGGGCAG GCAGGATTTG CCTTTGCGGG ATTGCTCTTT 
GGGCAACTGA TGCGCTTTGG TTATAACCTT GTGGTTGCCC GCTTGCTTGG CGTAGAAGCG
CTTGGCATTT ATGCGCTTGC CATCGCTGTT ATGCAAGTTG CTGAAGTGGT TGCGCTTGCA
GGGTGCGATG CATCACTGCT TCGTTTTGTC AACCTCTACC ACAACGATGC CGCACGTCAA
CGCCAAGTGA TTGGCTTTGC CGCTAAAAGT AGCTTACTCT TCTCGCTTGC TGTTATGGCG
TTGCTGATGC TCTTTGCCAA TCAACTCTCA GCGCTTTTCC ACGGCAATGA ACTGCTGACG
TTGGCACTCT CCTGCTATGC AGCCGCCTTG CCATTCAATG TGTTAACACA GGTTACAGCG
CACGCCTTGC AAGCATTTCA GCACTTAAAG CCGAAAATTA TTGCCACGCA ACTGCTCAGT
CCATTGCTTT TGCTGCTCTT CACCTTGCTT TTTTATTATA CCGTTGGCAT ACAAGCGGCA
TTGCTTATGC CCTTTCTCCT TTCAGCATGT GGCGCATTGC TCTGGATTCT TCTACCATTT
GCCACAACCA CCGGCATTCG CTTTATTGAC ATTGTACGCG CTCGGCACGA TAACGCCATG
CTAACCTATG CCTTGCCACT TATGGCAGTC TCGCTCTTTA GTATGCTAAG CCACTGGCTT
GATGTGATGA TGCTTGGCAT CTTTAGCGAT GCAGTTACCG TTGGATTGTA CCATCCAGCC
GCAAGAACCG CAGGCTTGTT ACGCTCCGTG CTTTTGGCAT TTGCAGGCAT TGCCGCACCG
CTTTTTGCAG AGCTTCACGC ACAAGGCAAC AAAGCCGAAA TGGCTCGTCT CTACAAATTA
GTTACACGCT GGAGCGTTAT CCTCCTTATT CCCCCTCTCT TGATTTTTAT GGTGCTACCG
CAGCAAGTAC TTTCGCTTTT TGGCGCCCAC TTTGCCGATA GCGGAGCTGT AGCCTTGCAA
CTCTTAAGCG CCGCATATTT TGTACAATGC GTTTTTGGCA TTGCCTCCAC CCTGCTTGCT
ATGAGCGGCT ATGCTCAACT CAGCCTCATA AACGCCGTTG TAGCACTTGC CTTACAAGCA
GGCTTAAATT GGCTTTTTAT TCCAACAATG GGATTACAAG GCGCAGCCGT TGCATCGTTA
GTGCTCTTTC TCTTGCTCTC AGCACTTCGA TGGCTGGAAG TTCGCCTCTT ATTGCAGATG
AATCCATTAA GCACCATGTT GTGGAAGCCG CTCGTTGCTG GAGCTGTTAC CTTCTTGCTA
CTCATGCTCA TGCACTCGTG GTTGCTCATG CTGCCATCGT TGCTGGCGCT TGGGGTTGGA
ACCGTTATTG CCTTTAGCTG TTATGTGGCT CTGATGTTGA TGCTGAAGTT GGAAGTGGAT
GAGAAGGAGA TTATTTTCAA GTATCTGCCT TTTATGAGGA AGGATGGATA G
 
Protein sequence
MSRNSLVAGQ AGFAFAGLLF GQLMRFGYNL VVARLLGVEA LGIYALAIAV MQVAEVVALA 
GCDASLLRFV NLYHNDAARQ RQVIGFAAKS SLLFSLAVMA LLMLFANQLS ALFHGNELLT
LALSCYAAAL PFNVLTQVTA HALQAFQHLK PKIIATQLLS PLLLLLFTLL FYYTVGIQAA
LLMPFLLSAC GALLWILLPF ATTTGIRFID IVRARHDNAM LTYALPLMAV SLFSMLSHWL
DVMMLGIFSD AVTVGLYHPA ARTAGLLRSV LLAFAGIAAP LFAELHAQGN KAEMARLYKL
VTRWSVILLI PPLLIFMVLP QQVLSLFGAH FADSGAVALQ LLSAAYFVQC VFGIASTLLA
MSGYAQLSLI NAVVALALQA GLNWLFIPTM GLQGAAVASL VLFLLLSALR WLEVRLLLQM
NPLSTMLWKP LVAGAVTFLL LMLMHSWLLM LPSLLALGVG TVIAFSCYVA LMLMLKLEVD
EKEIIFKYLP FMRKDG