Gene Cag_1857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1857 
Symbol 
ID3747010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2365057 
End bp2367453 
Gene Length2397 bp 
Protein Length798 aa 
Translation table11 
GC content46% 
IMG OID637774395 
Producthypothetical protein 
Protein accessionYP_380151 
Protein GI78189813 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC AGATTTTGTT GTTCTCAGCC ATTTTTAGCG GCTATACCTT CTTGCTCCTT 
TTGCTTTACT TTCCGTTAGT CTTTCAATCG CAAGTGCTTA CAGCCCCCGA TTCGCTTATC
CCACAAGCCT CATCCATGGC GCTTGATAAA CTCCAAGCTG AAAGCGGAAG CTATCCGTTG
TGGCAGCCGT GGATTTTTTC AGGAATGCCA ACCGTTGAAG CCTTTAGCTA TCTCAGCGGG
CTTTACTATC CCAATTTGTT GTTCAATCTT TTCCATACTG ATGGTGTGCT TCTCCAGCTT
CTTCATCTTG CTTTTGCGGG TGCAGGTACC TTCCTTTTGT TGCGCGATTT ACGTCTCTCC
TTGCTTGCCT CCATTGCTGG GGGATTGATC TTTCTCTGTA ATCCCTTTTT TAGCGCCATG
CTTGTGCATG GGCATGGTAG TCAGCTTATG ACCACTGCAT ATATGCCGTG GATGCTATGG
GCAGCAATGC GCTTTATGGA TCGTGGTGGC GTTGCTGAAG CGGGCATTTT TGCACTGATT
GCAGGCTTGC AATTGCAGCG AGCGCATGTG CAAATGGCTT ACTATTCGTG GCTTATGATG
CTGCTTTTGG TGGTTGTGTT GTTTGCAACA CGTCGTTGGG TTGTGCCACA AGCTGTGCAA
CGGGGTGGGC TTTTCGTAAT TGCTTCAGTA ACGGCAATTG CTATGGCTGC CGCAATTTAT
TTACCAGCCT CGCACTATGC TGAAGCCTCC GTGCGTGGTG CGGCAGTAGG TGGTGGTGGC
GCTGCGTGGG AATATGCAAC GCTTTGGTCG CTTCATCCAC TTGAAGCAAT AACCTTCCTT
TTCCCGGGAT TTTTTGGTTT TGGTGGTGTT ACCTATTGGG GCTTTATGCC CTTTACCGAC
TTTCCACATT ACGCAGGTCT TGTGGTGTTA CTGCTTGCTC TCATGGGGCT AATTATGCGC
CGTCGTGAAC CTATGACGTG GCTTTTTGCG GGCGTTGGCT TCCTTGCTCT TTTGCTTGCT
TTCGGACGAT TTTTTAGCCC AATCTTCGAC CTCTTTTATT CGTTTGCACC GCTTTTTAGC
CGCTTTCGTG TTCCTTCAAT GGCGCTCATT ATGCTCTATT TTGCGCTTGC TGCACTTGCG
GCTATTGGTT TGCATGAATT GCTTGAGCGT AAACCACAGC GCTTGCTCAA AGTGCTTCGA
CTTAGCAGTA TAGTTGTAGC GCTTCTATTG TTGATTTTTT TAGCCTTAGA AGAGGTTGCT
GAACATGCAG CACGTTCACT TTTTCCGCTC CCGCAAGTTG ATAGCTTTGA GTTAGTCTCT
GCCATTAATT CTATACGATG GGAACAACTT TCAAGCAGTG TTATTGTAAC TCTTACTCTT
TTACTGCTTG TAGCAGGTGT TTTGTGGCTT TTACTGAGTG GCAAAATTTC TTCAAAATAT
TCGGCATCGC TCCTTGTGTT GCTTGCTGTG GGTGATCTGC TGTGGGTTAC GGTTCAAGTT
ATTTATCCAT CAGCTCACTC ACTTCGTACT CCGCTTTTTG CCGATAAGCA ACAAGTTGCA
CCAGCATTTC AGCATGATGA TGTTACCCGT TTTCTTGCAA GTCAGCCCAA ACCCTTTCGT
ATCTATCCTG CGGGTAACTT CTTTACAGAA AATAAGTTTG CCCTTTTTGG AATTGAATCG
GTGGGTGGTT ACCATCCTGC CAAGCTAAAA AGCTACGATG ATCTATTGCA GGTGAGCGAT
AATCTTGCAA GTATTGCCCT CTTGCGTATG CTTAATGTTC ACTACATCGT TAGCCCCGCA
CCAATTGAGC ATCCCACACT TACATTGGCA ACAAGCGGTA CCTTGCAGCG TGCGAATGGT
TCAGCTCAAG CCTTTGTTTA TCGCTTGCAA GAGCCAGCAC CACGCGCATG GTTTGTCAGC
CGTGTTGTGC CATTTTCCAA CAAGCAAGAG CTATACAGCC ATTTGCTTGA TGATACTGCT
TCGCTTTCAG TGGCTTACGT TGAAGCGCAG CAATGGCAAG GCGCACAACG TTTTTCAGAA
GGCACCATTC AATCCGTTAC CACACAACCC GAATCCATTA AGCTTAATGT TAACGCACCA
AATAGTTCCT TTTTAGTGCT CAGTGAAATC TACTATCCCA ACGGTTGGCA GGTAATGCTT
GATGGTAAAG CAACTTCCAT GCTTCGGGTT AATGGCGTGT TGCGAGGGGT TAACGTACCG
GCAGGTAACC ATGCTATCCA CTTCAGTTAC AATCGCCATT TATTTGAGCA AAGCCAATGG
ATTGCTCTTG CGGGATTTAT TATTGCACTG CTGATGATTG CGGGTGGCTT GCTTTGGAAG
CATCTTCTTC TTTCAGGTGA AAAACGGGTT GTAAGAGGTT TTCATACAAT AAGATAA
 
Protein sequence
MKKQILLFSA IFSGYTFLLL LLYFPLVFQS QVLTAPDSLI PQASSMALDK LQAESGSYPL 
WQPWIFSGMP TVEAFSYLSG LYYPNLLFNL FHTDGVLLQL LHLAFAGAGT FLLLRDLRLS
LLASIAGGLI FLCNPFFSAM LVHGHGSQLM TTAYMPWMLW AAMRFMDRGG VAEAGIFALI
AGLQLQRAHV QMAYYSWLMM LLLVVVLFAT RRWVVPQAVQ RGGLFVIASV TAIAMAAAIY
LPASHYAEAS VRGAAVGGGG AAWEYATLWS LHPLEAITFL FPGFFGFGGV TYWGFMPFTD
FPHYAGLVVL LLALMGLIMR RREPMTWLFA GVGFLALLLA FGRFFSPIFD LFYSFAPLFS
RFRVPSMALI MLYFALAALA AIGLHELLER KPQRLLKVLR LSSIVVALLL LIFLALEEVA
EHAARSLFPL PQVDSFELVS AINSIRWEQL SSSVIVTLTL LLLVAGVLWL LLSGKISSKY
SASLLVLLAV GDLLWVTVQV IYPSAHSLRT PLFADKQQVA PAFQHDDVTR FLASQPKPFR
IYPAGNFFTE NKFALFGIES VGGYHPAKLK SYDDLLQVSD NLASIALLRM LNVHYIVSPA
PIEHPTLTLA TSGTLQRANG SAQAFVYRLQ EPAPRAWFVS RVVPFSNKQE LYSHLLDDTA
SLSVAYVEAQ QWQGAQRFSE GTIQSVTTQP ESIKLNVNAP NSSFLVLSEI YYPNGWQVML
DGKATSMLRV NGVLRGVNVP AGNHAIHFSY NRHLFEQSQW IALAGFIIAL LMIAGGLLWK
HLLLSGEKRV VRGFHTIR