Gene Cag_1722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1722 
Symbol 
ID3746505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2236557 
End bp2239598 
Gene Length3042 bp 
Protein Length1013 aa 
Translation table11 
GC content43% 
IMG OID637774259 
Producthypothetical protein 
Protein accessionYP_380016 
Protein GI78189678 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.919333 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTTG ATGAACTATA TCAGGTATTT CTTCCAACCT TTACCATGGT AGAAGAAGAA 
TGGAATAATT TTTTAGCTGA AAAGAATACT GCGCTCTCAG AAGCTCAAAA CTATCTTAAT
GCTATTTTTT CTATTACAGG CTACAATTTA CCTACCTACG AAGAAATATC ATCAACGCGC
TATCTTGGAG TCTATGAAAA TGGTGCGCAA GTAACCTTTG AGGGTAGTGG TATTGATAAG
CTCTTTGGTG GCGATATGAC TACCGGTACT CTCTCACTAT CTCGCATTGC TCTTGATTCG
TATGCCAATA ACATCCACAT GGCAATGCTT GGCTACAACA ATGGCATTAC GGTTGATTTA
AACAGTGGCG CTGTTAGTGG CATGTTTAAT GAATATTCGC TTACTACGCC ATGGTTTGAG
CTATCTGCGG TTGGTACGGT TGCGGTTTCA GGTGCTTCAA TTCTTTCGGA AATGAATATT
GAGGCTGAAC TGACGGAGCT TTCTTTTACT TATCCTGATG ATGGTGTTAT TGTAAAGCTG
CTTGGTGATA TTGATTATTA TCAAGATGAG CTTGGTAATA TGGAGTATAG CGGAGATGTT
TATACTGCGT ACTTTACGGG ATGGGGTGCC GATATTACTC TGAATGGAGA TTTTCAGTGT
GATTTTGATA TTAATAATAC TTTTATATTA TTTAGTGATG AACTAACAGG GGAGCTTTAT
GTATCAGAAC TTTCTCTTGT AATTCCATCT CAACATGTTG TTGCTAATTT TTATAATTCT
TTAACAGGAT TATCGTATGA TATTACCTCT GGTTCTCTTG GTGGGAGTTT TAATTCATTC
CATTTTGGAA CTTCGTTATT TGATGTTTAT GCACATGGCT CAATTTATGT AGAGCCTGTT
AGCTCATCAT CTACTTTAGG TATTCATGGA ATATTGGATA GCATTGAAAT AACCTATCCT
GAGAGTACGT TTGCAATAAC TGTTGTTGGG GATGTAGAGT ATATCCAAAT TGATCAAGGT
GAATATACCT ATTTTGGAAC TATGTCGGAA GTATATCTGG AAAATCCTAC AACCACAGTT
TCTGTACTTG GTGATTTTTC AGGTTCATAT AATGACTCTA ATGGATTACA TCTTGCAGGA
AATTTGTATG AATTCCATTG GCAGCGGGAA GAGGCATTTA TTTCATTTGT TGGTGATATA
GTATTTGGTG AAGATCAGCT TGTTGTTAAT GAGGTTACAA CTCTTGAGGT GTATGGGGAT
GGACGATATT ACGATGCTTC ATCGCTTTCT GTAATGAATA TAGTTACTGA TGTTGTAGGT
AATGAGTTGC TTGCTCTTGG TGAAGATGCT AATTGGGATA TAAGTGCAGC GTTGGATGAA
TTGTTGTGGG ACGTTATCAA TAAATTGAAT GGTGATGCGG AAGGTGTTAG TAGTGTAAAT
TTTGATAGTG TTCCCGCAAG TAGCACACCT GTAAATGCTG AATTTCTTGA TTTCTATCTT
GATTTATCAA AAGTAGGTGA AGCGGGGTAT TATGCTTCGT TCCGTGTAGG TCATTTGTAC
GATACTAACG GCGATGGTTT ACCAGATTAC GTAGATGAAA TTCATGATAG CCCAGCTACC
ATTACATGGA ATAATGGAAT GTTTACAGTT CTTAGTCTTG ATGATAGTAG CACAAGGGCT
ACCGGTTCTT TGGCTTATGA TGGTAATGGA AACGCCGTTG GATTGTATGC CTTTGATCGT
GCTTCTGGTG ATAGCGAAAC CACTCCTCCA ACCCTCATCG CTGCTACCCC ATCGGATAAT
GCTATGGGCA TAGAGGTTGA AAGTGATCTA TCATTTATCT TTAGCGAAAA TGTACAATTT
GGTAACGGTA CTATTGAAAT TCACAGAGGT TCTGCCACGG GTGAATTTGT CGAGAGTTAC
AATATCGGCA CTCCACTCAG CACGAATCTC AACATTGTTG GTTCTACGCT TACTATTAAT
CCAACAAGCG ATTTAGCAAG CAATACCCAC TACTTTGTAA CCTTTAGTGA AGGAAGTATT
CGAGATTTAG ATGGCAATAA TTACGTTGCA TCGCAACCTT ACGACTTTAC TACAGGCGCT
GATCCTTATC CAACGCACAC CCTCACAGGC AACATCACCT TCTGGAAAAC CGGCGAAGCC
ATTACCGACG TAACAACAAC CCTTACCACC TTGCCAACCA ATGGCACACA CGCCATTGAA
CTTAAAAACA TTCACGTTCA AGCCAATGGC AGCCACACCA TTGAAGTATG GGCAACCACG
CCAAACAGCA CCACTGGTAG CTTTGAATGC GAATTTGCCT TGCCTACAGG CACAAGCGTA
ACGTGGCAGG ATGCCGCTAA ATTGCCTTCA GGCTGGATGA CAACCAACAA CGTCATCGCC
ACCGGTGCGT TCCGTGTTGC GAGTATTGGC ACCCATGCTT TAGCTGAAGG TGCAGTACAA
CTTGGCACGC TCACCATTAG CCAATCCGCA AACCCCGGCA CCTTTGAACT CGCTATGACG
CACGCCCAAC TTGGCAACAA CGATGTTGCC GGCTATGCAA TCAGCAGCGT TAGCTCCACC
ACCGGCAGCG GCAATGAATA TCAGTACCAT TCGCTTACCG ATGGGCACTA TGCGCTCACG
GGCGACAAAG CCGCAGGAGA TGCAGGCAGC GCCGTACACG CTAACGACGC ACTTGCCGCC
TTAAAAATGG CAGTGGAACT TAATCCCAAC GAAGCCAACG CAAATGGGTT GCTCGGTCCC
GTTTCACCAT TCCAATACCT TGCTGCCGAC ATCAACCGCG ACGGCAAAGT GCGTGCAAAC
GATGCCCTCA ATATTTTGAA AATGGCAGTT GGCATTGAAT CAGCACCAAC CGACGAATGG
ATTTTTGTTG CCGAATCCGT TACCGGCAAA ACCATGGATC GCAGCCATGT TGACTGGTCA
GACATCAGTC CTATTGTGGA CTTCAACCAA ACCGCCATTG AACTCGACCT CATCGGTATT
GTCAAAGGCG ATGTTGATGG CAGTTGGGTA ATGGTGGGAT AA
 
Protein sequence
MKLDELYQVF LPTFTMVEEE WNNFLAEKNT ALSEAQNYLN AIFSITGYNL PTYEEISSTR 
YLGVYENGAQ VTFEGSGIDK LFGGDMTTGT LSLSRIALDS YANNIHMAML GYNNGITVDL
NSGAVSGMFN EYSLTTPWFE LSAVGTVAVS GASILSEMNI EAELTELSFT YPDDGVIVKL
LGDIDYYQDE LGNMEYSGDV YTAYFTGWGA DITLNGDFQC DFDINNTFIL FSDELTGELY
VSELSLVIPS QHVVANFYNS LTGLSYDITS GSLGGSFNSF HFGTSLFDVY AHGSIYVEPV
SSSSTLGIHG ILDSIEITYP ESTFAITVVG DVEYIQIDQG EYTYFGTMSE VYLENPTTTV
SVLGDFSGSY NDSNGLHLAG NLYEFHWQRE EAFISFVGDI VFGEDQLVVN EVTTLEVYGD
GRYYDASSLS VMNIVTDVVG NELLALGEDA NWDISAALDE LLWDVINKLN GDAEGVSSVN
FDSVPASSTP VNAEFLDFYL DLSKVGEAGY YASFRVGHLY DTNGDGLPDY VDEIHDSPAT
ITWNNGMFTV LSLDDSSTRA TGSLAYDGNG NAVGLYAFDR ASGDSETTPP TLIAATPSDN
AMGIEVESDL SFIFSENVQF GNGTIEIHRG SATGEFVESY NIGTPLSTNL NIVGSTLTIN
PTSDLASNTH YFVTFSEGSI RDLDGNNYVA SQPYDFTTGA DPYPTHTLTG NITFWKTGEA
ITDVTTTLTT LPTNGTHAIE LKNIHVQANG SHTIEVWATT PNSTTGSFEC EFALPTGTSV
TWQDAAKLPS GWMTTNNVIA TGAFRVASIG THALAEGAVQ LGTLTISQSA NPGTFELAMT
HAQLGNNDVA GYAISSVSST TGSGNEYQYH SLTDGHYALT GDKAAGDAGS AVHANDALAA
LKMAVELNPN EANANGLLGP VSPFQYLAAD INRDGKVRAN DALNILKMAV GIESAPTDEW
IFVAESVTGK TMDRSHVDWS DISPIVDFNQ TAIELDLIGI VKGDVDGSWV MVG