Gene Cag_0843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0843 
Symbol 
ID3746802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1171771 
End bp1175148 
Gene Length3378 bp 
Protein Length1125 aa 
Translation table11 
GC content51% 
IMG OID637773372 
Producthypothetical protein 
Protein accessionYP_379151 
Protein GI78188813 
COG category[S] Function unknown 
COG ID[COG4913] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAGC CGCTTGAAAT GGGATTTGTT GCCGATGACC GCTTGGCTGG TTTTCGACTG 
CAACGGCTTG AAATATTCAA TTGGGGCACC TTTGATGGCA GGGTTTGGAC GCTCAAATTG
GGTGGAAAAA ATGGTTTGCT TACGGGCGAT ATTGGCTCTG GAAAATCAAC CGTTGTTGAT
GCTGTAACCA CACTGTTAGT GCCAAGCCAG CGCATTGCTT ACAACAAGGC GGCAGGTGCC
GATAACAAAG AACGCACCTT GCGCTCTTAC GTGCTGGGCT ACTATAAATC GGAGCGGCAG
GAGAACCTTG GCGGTGGTGC AAAACCCGTT GCTCTGCGCG ACCTCAACAG CTATTCGGTG
ATTCTCGGTG TTTTTCATAA CGAAGGTTAC GACAAAACGG TAACGTTGGC GCAACTCTTT
TGGATGAAGG ATGCTGCTCA ACCTGCCCGC CTTTATGCTG CGTATGAAGG CGACCTCTCC
ATTGCCACCG ATTTTTCTAA CTTTGGTGCT GAAATCGCGA CGCTGCGCAA GCGCCTTCGT
GGTGCGGGAG TTGAGCTATT TGAGAGCTTT CCACCCTATG GCGCATGGTT TCGGCGACGC
TTCGGCATTG AGAACGAGCA AGCATTAGAG CTGTTTCATC AAACCGTTTC GTTAAAATCT
GTAGGAAACT TAACGGACTT TGTGCGCCTC CACATGCTTG AACCCTTTAC GGTTGAGCCA
CGTATTGCCG CCCTTATCCA CCATTTTGAA GATCTTAATC GGGCGCACGA AGCGGTGCTC
AAGGCAAAGC GGCAAATTGA GATGCTTGCA CCTTTGGTGG CAGATTGCGA CAATCACCAC
GCTATGGCGC AAAGAACAGA AGAGCTGCGG GCATCGCGCG ACTGCCTTCG CCCTTGGTTC
GCTTCCCTTA AACTTGAATT GCTCGAAAAA CGCCATACTT CGCTTAATGA GGAGCTAAGC
CGCCACCAGA TCGCCATTGA GCGTCTGGAC GGGGAGCGCC GCACCTTGCA GGGGCGCGAT
CGTGAGCTGC GCCGAACTAT TGCTGAAAAC GGGGGCGATC GAATTGAGAG TATTGCGGCT
GAGATTCGGC AGCATCAGCA AGAGCTGGAT CGGCGCACCC AAAAATCAAC CCGATATAAA
GAACTTTTGA GGCAGCTTGG GGGGCAGCTT GGCGAGCATC CCGCAACAAG CGCTGAGGAG
TTTTATCAGC AGCGAGCAGA ACATGCCGCC ATGCACGAAA GTGCGGCTGA AGCTGAAGTA
CAGGTTCAAA ACAATCTGAA CGAAGCAGGT GTGCTCTTTA CCCAAGGGCG TCAGGAGTAT
GAGCAACTGA GCACCGAAAT CAAGAGCTTG AAAGCACGCA TGAGCAACAT TGATGAAAAG
CAAATTGCCA TGCGCCATGC GCTCTGCAAG GCGTTGAATT TGCCTGAGGT GGAGATGCCT
TTTGCGGGCG AATTGCTGCA AGTTCGTGAA GAGGAAACAG CTTGGGAGGG AGCTATTGAA
CGGGTGTTGC GCAACTTTGG GTTGTCGCTT TTAGTGCCCG ACCACCATTA CCCAAAAGTG
GCGGAATGGG TGGAGCGCAC CAATCTGAAA GGCAAACTTG TCTATTTTCG AGTTCGTCCG
CAATCTCGCA ATGAGCAGTT GGCGGATCAT CCCGCTTCAC TTGCCCGCAA GCTTGCCATC
AAAGCCGATT CCACCTATTA CGATTGGATT GAACGGGAAG TTTCCCATCG TTTTGATCTG
ATTTGCTGCA CCACACAAGA GGAGTTTCGG CGGGAGAAAA AAGCCATCAC GCCAGCGGGA
CAAATTAAGT CGCCGGGTGA ACGGCACGAA AAGGATGATC GCCATCGTCT TGACGACCGT
AGCCGCTACG TGCTTGGGTG GAGCAATGCC GCTAAAATTG CTGCCCTTGA AGCAAAGGCG
AAGGTGCAGC AACGTGAGCT TACCAAACTT GCCGAACGCA TCAGCACGTT GCAACAAGAG
CAAAAAGCGC TTAAAGAGCG GCTAACCATT CTCTCCAAGC TTGATGAATA TTCCGATTTC
AACGACCTCG ATTGGCAGCC ATTGGCAGTT GCTATTGCGC GATTAGAAAA GGAAAAAGAG
GAGCTTGAAA AAACCTCCAA TATTCTGCAA ACCCTTACCG AGCAGCTTGC TTTGCTGGAA
CAAGAGCTGC AAAAAACGGA GCAGCAGCTT GACGACCGAA AGGATAAACG CTCTAAAATT
GAGCAGAAAA TTAGCAGCAT CACTGAGTTG CAGCAACAAA CGGCGGCACT GCTTGCAGAA
GCAGGAGACG AAGTAAGCAA CCGTTTTGCC CTCCTTCAAG CAATGCGCCA AGAGGCTTTT
GGCGATCAAT CGCTAACGGT TGAATCGTGT GACAATCGCG AGCGGGAGAT GCGTGAATGG
TTGCAGAATA AAATTGACAG CGAAGATAGA AAGCTCTCTC GATTGGGTGA AAAAATCATT
CGGGCAATGA CCGAATACAA GGAGGAGTGG AAACTTGAAA CCCGCGACGT GGATGTTAAT
ATAGCTGCCG GTAAAGAGTA TCGTGCCATG TTTGAGCAAC TTCAGGCTGA CGATTTGCCT
CGCTTTGAGG GGCGCTTTAA AGAGCTGCTG AATGAGAATA CCATCCGCGA AGTTGCCAAT
TTTCAGTCGC AACTTGCCCG CGAACGCGAA ACCATCAAAG AGCGTATTGT TCGCCTCAAT
GAATCGCTAA CACAAATTGA TTTTAATCCT GGGCGATACA TCACCCTTGA AGCTGAGAAT
AGCCTTGATG CCGACATCCG TGATTTTCAA ACGGAGCTTC GTGCTTGCAC CGAAGGAACG
CTGACAGGCT CGGATGATGC GCAATATTCG GAAGCCAAAT TCCTGCAAGT ACGGCGCATT
ATTGATCGCT TTCAAGGGCG CGAAGCTTAT GCTGACCTCG ACCGTCGCTG GACAGCCAAA
GTAACGGATG TGCGCAACTG GTTTGTTTTT GCCGCCAGCG AACGGTGGCG CGAAGATGAC
ATTGAGCACG AACACTACGC CGATTCAGGT GGTAAATCGG GAGGGCAAAA GGAGAAACTC
GCCTACACCG TGCTTGCCGC CAGCCTTGCC TATCAATTCG GCTTGGAATG GGGCGCCGTG
CGCTCCCGCT CTTTCCGCTT CGTGGTGATT GACGAAGCTT TCGGACGCGG CTCCGACGAA
TCGGCACAAT ATGGATTGCA ACTCTTCGCC CAACTTAACC TGCAACTGCT CATTGTTACG
CCATTGCAGA AAATCCACAT TATTGAACCC TTTGTTGCCA GCGTTGGCTT TGTGCACAAT
CAAGAAGGGC GCTGCTCAGT GTTACGCAAC CTCACCATCG AAGAGTATCG CTCCGAAAAA
GAGAGGGCAA TTGCATGA
 
Protein sequence
MSEPLEMGFV ADDRLAGFRL QRLEIFNWGT FDGRVWTLKL GGKNGLLTGD IGSGKSTVVD 
AVTTLLVPSQ RIAYNKAAGA DNKERTLRSY VLGYYKSERQ ENLGGGAKPV ALRDLNSYSV
ILGVFHNEGY DKTVTLAQLF WMKDAAQPAR LYAAYEGDLS IATDFSNFGA EIATLRKRLR
GAGVELFESF PPYGAWFRRR FGIENEQALE LFHQTVSLKS VGNLTDFVRL HMLEPFTVEP
RIAALIHHFE DLNRAHEAVL KAKRQIEMLA PLVADCDNHH AMAQRTEELR ASRDCLRPWF
ASLKLELLEK RHTSLNEELS RHQIAIERLD GERRTLQGRD RELRRTIAEN GGDRIESIAA
EIRQHQQELD RRTQKSTRYK ELLRQLGGQL GEHPATSAEE FYQQRAEHAA MHESAAEAEV
QVQNNLNEAG VLFTQGRQEY EQLSTEIKSL KARMSNIDEK QIAMRHALCK ALNLPEVEMP
FAGELLQVRE EETAWEGAIE RVLRNFGLSL LVPDHHYPKV AEWVERTNLK GKLVYFRVRP
QSRNEQLADH PASLARKLAI KADSTYYDWI EREVSHRFDL ICCTTQEEFR REKKAITPAG
QIKSPGERHE KDDRHRLDDR SRYVLGWSNA AKIAALEAKA KVQQRELTKL AERISTLQQE
QKALKERLTI LSKLDEYSDF NDLDWQPLAV AIARLEKEKE ELEKTSNILQ TLTEQLALLE
QELQKTEQQL DDRKDKRSKI EQKISSITEL QQQTAALLAE AGDEVSNRFA LLQAMRQEAF
GDQSLTVESC DNREREMREW LQNKIDSEDR KLSRLGEKII RAMTEYKEEW KLETRDVDVN
IAAGKEYRAM FEQLQADDLP RFEGRFKELL NENTIREVAN FQSQLARERE TIKERIVRLN
ESLTQIDFNP GRYITLEAEN SLDADIRDFQ TELRACTEGT LTGSDDAQYS EAKFLQVRRI
IDRFQGREAY ADLDRRWTAK VTDVRNWFVF AASERWREDD IEHEHYADSG GKSGGQKEKL
AYTVLAASLA YQFGLEWGAV RSRSFRFVVI DEAFGRGSDE SAQYGLQLFA QLNLQLLIVT
PLQKIHIIEP FVASVGFVHN QEGRCSVLRN LTIEEYRSEK ERAIA