Gene Cag_1481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1481 
Symbol 
ID3747972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1947129 
End bp1950488 
Gene Length3360 bp 
Protein Length1119 aa 
Translation table11 
GC content39% 
IMG OID637774016 
Productglycosyltransferase-like protein 
Protein accessionYP_379780 
Protein GI78189442 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAT CTATAAATAT TTTTACTCCA GATAAAATTA CATTACCATT ATCTTGGGTT 
GGACATATAC CTTTTGCTGC TTGGCTTGTA GAAATACTTC ATCCTCAAAT ACTCGTAGAA
TTGGGTACGC ATTCAGGCAA TTTATATTTC GCTTTTTGCC AGACTGTCAA GAAAAATCAA
CTTTTAACAA AATGCTATAC AGTGGATACA TGGAAGGAGG CTCAGCACTC CGGCATCTAT
GACAACAATG TCTATAATGA AATATTGGCA TACAATAGCA CAATTTACGG TGATTTCTCT
CAGTTGTTTT GTATGACTTT TGACGAAGCG CTGACAAAAT TTGAAAATGG TTCGATTGAC
TTACTGCACA TAAACGGAGA GTACACCTAT CAGGCTGCCC GAAAAGATTT CGAAACATGG
CTCCCCAAAA TGTCTAATAA AGGCATCATT CTCTTTCATG ACATCATGGT TAAAGAAAGA
GAGTTTGGCG TGTACAGGTT ATGGGAAGAA CTTTCCTCCC AATATGGGCA TTTCGAATTT
ACCCACTCTC ATGGGCTTGG CGTTCTTCTG ACAGGGAAAA ACCAGCACCC TGCTATTGAA
AGTATGGCTC AAGATTTTCA GGATACCCAA AAAAAGAAGC TTATCAGTGG ATACTTCGAA
CATAGTGGAT ACGCAATAGA GCTGGAGTAT CAACATCAAT CAGATGCAAC TAAAATTCAT
AAGCTTAGCC AGCAACTAAA AAGTCAGTCA TTACAGATCA ATTCATTACA AAAGCATAAT
CAAAGTTTGC AACATGAAAT ACAGGAATTG AACAAATCGC TTGTCCGAAT TACCAACTCC
AGTAGTTGGC GCTTAACAAA GCCTATCAGA AAATGGAGTA AATCTTTGCG AAAAAGATTT
AGAAAAATTC GATATTTCTT AACGGGGGAA ACAGCTGAGA ATGTACCAAA AAGATTAACA
ACGCTCTGCA ATAACTGGTT CAAACCAACC GATAAGACTC GCATATTAAT CATCGACTCA
TGGATACCAG CACCTGACCA GGATTCTGGC TCTATGGATA CATTTCTGAC AATGAAAGCC
CTTGTGGAAC TCGGATATGA CATCACGTTC ATACCAAAAG ATCTGAAGGC AAAACAGAAA
TATGTACAGT TACTTGAAAA CGAGGGAGTA AGATATCCAG ATCTAAGCAA AGCGGCCATC
TCTATTGAAG AGTTTCTCAA GGTTGCAGGG CATTATTTTG ACCTTGTTAT GCTGTACAGG
GTCGATACAG CCTCCTCATT TCTTGCCATG GTAAAGCACT ATGCTCCACA GGCAAAAATT
GTATTCAACA CTGTTGACCT TCACTTTCTG AGAGAACAAA GAAACGCTGA GCTTTCTGGA
TCTGACATAA TGCGTAAAAA TGCATTGAAG ACAAAAGAGC ACGAGTTGCA ACTCATGCAG
CAAGCAGATA GCACAATAGT ACTTAGTAAT GTAGAATTTG ATTTAGTCAA AAAAATCAAA
CCTGAGGTAA ATCTTGAACT CATGCCTTTT TTCAGAATGA TTCCAGGAAG GTCAGCAGCG
TTTCATGAAA GAAAAAATAT TGTATTCATT GGTGGCTTCA AACACCAGCC GAATCTTGAT
GCCATTACGT ATTTCATTTC AGAAATATGG CCTAAGGTAC ACCTTAAGCT ACATGATGCC
AAACTACGTA TAATAGGTAG TAATCCACCT AAAGAGCTCT ATCGACTGGT AGATTCTGAC
AACACTATTG AGCTTTTGGG ATATGTCGCA AATCTTGATC CTGAATTTAA TACATGTAAA
CTTACAGTTG CACCTTTACG GTCAGGAGCA GGGATAAAAG GAAAAATTGT CACAAGCTTG
AGCTACGGTG TACCATGTGT TGCAAGCCCT ATTGCTTCAG AAGGTATGGA GTTAATTCCT
GATAAAGACC TACTAGTTGC CAAAGAGCCC GATGAGTTTG CCAATAAGAT CATAAAACTC
TACACAGATG AGGCATTGTG GAACGCCCTA AGTGATAATG CGCTCACTAC AGTTGAAGAG
CGATATTCCT ATAAAGCAGG GAAAAAGAGA ATTGGGGATT TTCTCAATAA ACTACTCGGC
AGCTCTCGTC ACTCAGTTTG GGGTTCTGAA GAATTTTTGC AAAACAATCT GGCAAATGAA
ACGGACGATA CAGATGGGAA AAAACGTATC ATCATTGAAC TACCTTCTTT TGACAAAGGT
GGCCTTGAAA AGGTAGTATT AGATTCCATT TTAGCATTCA ATAAAAACAA GTTTCATTTC
CTTATTGTCA CACCCGGGAA ACTTGGTGAA TTAAGTACTG TCGCAACAAA CGCTGGATTA
TCTGTCATTC AATTACCTGA CATTAACCAC GAAGCAGCAT ATGAACGGTT GGTTATAAAA
TACCGTCCTC ATGCATCAAT GTCGCATTTT TCTCATTTGG GATATCCGGT TTTCATAAAC
CATCATATTC CAAATATTAC ATTCATTCAC AATGTTTATG CCTTTCTTAG TGAGAAGCAC
AAAAAAGAAA TCATGATGTA TGACCATGCA GTCACTCGTT ATATAGCTGT TTCGCCTAAA
GTGGCCTGCT ATGCAGAAAA GAATCTTGGT ATAAACCAAG AAAAAATTAC GATCATTCCA
AACGGATTGT GCATCACGGA ACATGAAGAA CGGCAGAAGA GAGCAACTCC AGCATTAAGA
GATGATTTTG GCCTAAACAA AAATGATTTT GTTTTTCTGA ATCCAGCTTC GTACAATTTA
CACAAAGGGC ACTATATCAT GGTTGATGCC CTTCAGATAG TGACAAAAAA AAGAAAGGAC
CTCAAAATCC TGTGCGTTGG CAATATCGTC CATGAACCAC ATTACCATGA ACTTCAGCAA
TACATCATAT CATGTGGTCT TTCAGAGCAT ATGCTAATGC TAGGTTATAT ATCGAAGATT
GAAAATATCA TGCCCATTGT TGATGCCTGC ATTATGCCAT CCTTTATTGA AGGTTGGAGC
ATTGCCATGA ACGAAGCTAT GTTTTACGGC AAGCCCCTCA TTATGACTGA TACAGGCGGA
GCATCTGAAG TCATCGAAAA TAATGATATC GGCATACTCA TTCCGAATGA ATATGGTGCC
TCGGATTTAT TGGATCGCAC CACTCTTGAC AAACTTGCCT ACAAACCGCA TCATTACAAA
ATATCAAGCA TGGTGGCTGA TGCGATGATA GCGTTTGCTG ATAACCATGA ATACTGGAAA
AAGGCCGGAG AAAAAGGCCG GAAAAAGATT TATCGCCATT ATGCATTTAA GAATGTGGTG
GCACAATATG AAGAAATCAT GAATCAAGTA ACTGAACCAG TAACTTACGA ACCACAATGA
 
Protein sequence
MQKSINIFTP DKITLPLSWV GHIPFAAWLV EILHPQILVE LGTHSGNLYF AFCQTVKKNQ 
LLTKCYTVDT WKEAQHSGIY DNNVYNEILA YNSTIYGDFS QLFCMTFDEA LTKFENGSID
LLHINGEYTY QAARKDFETW LPKMSNKGII LFHDIMVKER EFGVYRLWEE LSSQYGHFEF
THSHGLGVLL TGKNQHPAIE SMAQDFQDTQ KKKLISGYFE HSGYAIELEY QHQSDATKIH
KLSQQLKSQS LQINSLQKHN QSLQHEIQEL NKSLVRITNS SSWRLTKPIR KWSKSLRKRF
RKIRYFLTGE TAENVPKRLT TLCNNWFKPT DKTRILIIDS WIPAPDQDSG SMDTFLTMKA
LVELGYDITF IPKDLKAKQK YVQLLENEGV RYPDLSKAAI SIEEFLKVAG HYFDLVMLYR
VDTASSFLAM VKHYAPQAKI VFNTVDLHFL REQRNAELSG SDIMRKNALK TKEHELQLMQ
QADSTIVLSN VEFDLVKKIK PEVNLELMPF FRMIPGRSAA FHERKNIVFI GGFKHQPNLD
AITYFISEIW PKVHLKLHDA KLRIIGSNPP KELYRLVDSD NTIELLGYVA NLDPEFNTCK
LTVAPLRSGA GIKGKIVTSL SYGVPCVASP IASEGMELIP DKDLLVAKEP DEFANKIIKL
YTDEALWNAL SDNALTTVEE RYSYKAGKKR IGDFLNKLLG SSRHSVWGSE EFLQNNLANE
TDDTDGKKRI IIELPSFDKG GLEKVVLDSI LAFNKNKFHF LIVTPGKLGE LSTVATNAGL
SVIQLPDINH EAAYERLVIK YRPHASMSHF SHLGYPVFIN HHIPNITFIH NVYAFLSEKH
KKEIMMYDHA VTRYIAVSPK VACYAEKNLG INQEKITIIP NGLCITEHEE RQKRATPALR
DDFGLNKNDF VFLNPASYNL HKGHYIMVDA LQIVTKKRKD LKILCVGNIV HEPHYHELQQ
YIISCGLSEH MLMLGYISKI ENIMPIVDAC IMPSFIEGWS IAMNEAMFYG KPLIMTDTGG
ASEVIENNDI GILIPNEYGA SDLLDRTTLD KLAYKPHHYK ISSMVADAMI AFADNHEYWK
KAGEKGRKKI YRHYAFKNVV AQYEEIMNQV TEPVTYEPQ