Gene Cag_1541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1541 
Symbol 
ID3746600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2020049 
End bp2021623 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content46% 
IMG OID637774081 
Productphosphodiesterase 
Protein accessionYP_379839 
Protein GI78189501 
COG category[R] General function prediction only 
COG ID[COG1418] Predicted HD superfamily hydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR03319] conserved hypothetical protein YmdA/YtgF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00592062 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGATTG TTATAAACCT CTTATTGCTT GTATTAGCGG CTCTTGTGGC GTTTGTTGCA 
GGCTTTTTTA TTGGGCGCTA CTTTCTTGAG CGCATTGGTA CTACAAAGGT TTTAGAGGCT
GAAGAACGAG CGGTGCAAAT TGTGCAGGAA GCTCAAAAAG AGGCAAATGA GTACAAGGAA
TTAAAGGTTA GCGAAGTTAA TCAGGAGTGG AAAAAAAAGC GTCGTGAGTT TGAGCAAGAT
GTGCTTATTA AGAACAACAA ATTTGCACAG TTACAAAAGC AGTTGCAGCA ACGCGAAGCG
CAACTGAAAA AGCAATCGCA AGATGTGCGC GATGCTGAGC GCAAATTGCA AGATCAGCGC
AAAGAAGTAG AGCAGTTAAG TGATTCGGTG AAGCTTCGTG CTACTGAGCT TGAGCGCGTT
ATTGTGGAGC AAAATCAACG TCTCGAAAGC ATTAGCAATC TGCAAGCTGA TGAGGCTCGC
CAAATGCTTA TTGATAATAT GGTTACACAA GCACGCGAAG AAGCAAGCAA CACCATTCAC
CGCATTCACG AAGAGGCTGA GCAGCAAGCC ACGCGCATGG CAGAAAAAAC CCTCATTACG
GCTATCCAGC GCATCTCTTT TGAGCAAACC ACTGAAAATG CTCTTTCGGT AGTTCACATT
CAAAGTGATG AATTAAAAGG GCGCATTATT GGTCGTGAAG GGCGCAACAT TAAAGCTTTT
GAAAATGCTA CTGGGGTTGA CATTATTGTT GACGATACCC CCGAAGTGGT TATTCTCTCC
TGCTTTGATC CCTTGCGCCG AGAGCTGGCA AAACTCACCC TTAAAAAATT GCTTGCCGAT
GGCATTATTC ATCCCGTAGC TATTGAAAAA GCTTATGCGG ATGCTACCAA AGAGATTGAC
GATGTTGTCT ATAGTGCGGG CGAAGAGGTG GCGGCATCGC TCCAACTTAA CGACATTCCC
ACCGAAGTGA TTGCGCTTCT TGGCAAAATG AAGTTCCACA CCGTGTATGG GCAGAACTTG
CTACAACATA GCCGTGAAGT AGCAATGCTT GCAGGCGTTA TGGCGGCAGA GTTAAAGCTT
GATGCACGTA TGGCAAAACG GGCAGGTTTA TTGCACGATA TTGGCTTAGT GCTGCCCGAA
AGCGATGAGC CACATGCAAT TACGGGCATG AATTTTATGA AGAAATTTAA TGAGTCAGAC
CAACTGCTTA ACGCTATTGG CGCTCACCAT GGTGATATGG AAAAAGAGTC GCCACTTGCC
GATTTAGTTG ATGCCGCCAA CACCATTTCG CTTTCACGTC CCGGTGCGCG TGGTGCCGTA
ACGGCTGATG GCAACGTTAA ACGCCTTGAA AGCCTTGAAG AAATTGCAAA GGGCTTCCCT
GGAGTGTTAA AGACCTATGC GTTACAAGCA GGGCGCGAAA TTCGTGTGAT TGTGGAAGGC
GATAACGTCA GCGATTCGCA AGCCGATATG CTTGCCCACG ATATTGCTCG TAAAATTGAG
TCGGAAGCGC AATATCCCGG TCAAATTAAA GTTTCCATTA TTCGCGAAAA GCGTTCAGTG
GCTTACGCCA AGTAA
 
Protein sequence
MGIVINLLLL VLAALVAFVA GFFIGRYFLE RIGTTKVLEA EERAVQIVQE AQKEANEYKE 
LKVSEVNQEW KKKRREFEQD VLIKNNKFAQ LQKQLQQREA QLKKQSQDVR DAERKLQDQR
KEVEQLSDSV KLRATELERV IVEQNQRLES ISNLQADEAR QMLIDNMVTQ AREEASNTIH
RIHEEAEQQA TRMAEKTLIT AIQRISFEQT TENALSVVHI QSDELKGRII GREGRNIKAF
ENATGVDIIV DDTPEVVILS CFDPLRRELA KLTLKKLLAD GIIHPVAIEK AYADATKEID
DVVYSAGEEV AASLQLNDIP TEVIALLGKM KFHTVYGQNL LQHSREVAML AGVMAAELKL
DARMAKRAGL LHDIGLVLPE SDEPHAITGM NFMKKFNESD QLLNAIGAHH GDMEKESPLA
DLVDAANTIS LSRPGARGAV TADGNVKRLE SLEEIAKGFP GVLKTYALQA GREIRVIVEG
DNVSDSQADM LAHDIARKIE SEAQYPGQIK VSIIREKRSV AYAK