Gene Cag_1055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1055 
Symbol 
ID3747037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1431759 
End bp1436114 
Gene Length4356 bp 
Protein Length1451 aa 
Translation table11 
GC content44% 
IMG OID637773585 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_379360 
Protein GI78189022 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain
[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0430168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGGA TATTTAACGT GATCTGGTCG GTTACCAGAG AAAAGTGGGT TGTGGTTTCA 
GAAAAAGTTA AATCAAATGG TTCCGTACCA AAATCATCAT TAGTAAGCAT TGCTTTTCTT
TCGGCATTGC TTGGTGGTGG CAGTGTAGCT CAAGCCGTTG AGCCCGGGCA GTTACCAACG
GGCGGCGTTA TTACAGCAGG TAGTGGTTCC ATTGCTACAA ACGGCAACAG CATGACTATT
CAGCAGTCAA GCCAAAAAAT GGTGGCTAAT TGGAACAACT TTAATGTTGG CAGCGATGCA
AGTGTGCGCT TTCAGCAGCC AAACGCTTCT GCCGCTGCAC TCAACCGTAT TGCCGGGCAA
AATCCGTCAC AAATTCTTGG CTCACTTTCT GCTAACGGAC GCGTTTTTCT TATCAATCCA
TCAGGTATTG TATTTGGACA GAATGCTCGT GTTGATGTCG GTGGATTGGT TGCTTCAACA
CTTGATATTT CCGATTACGA TTTTCTTGCT GGTAACTTCG CTTTTCGTTC AACGGGATCG
GCTGGCACCT TGCGCAATGA AGGATTAATT AACGCTATGC CGGGTGGCGT GGTGGCACTC
TTAAGCCCTT CAGTTATCAA TAACGGCACC ATTACTGCGG TGGGCGGCAG TGTAGCGCTT
GCAGCGGGAA ACCAAATGAC GCTTGACTTT GGCGGCGATG GCTTAATGAC TGTTCGAGTT
GATGACGGTG CGGTCAATGC GTTTGTGGAG AACAATTCGC TTATTAAAGC TGATGGCGGA
TTAGTTGTAA TGAGCGCAAA AGCCGCTAAC AATCTTGCTT TTTCTGCAGT TAACAACAAC
GGCGTCGTGC AAGCAATGAG CGTTGTTGAA AAAAATGGAC GCATTTTGCT TGATGCCGAA
GGTGGGCAAA GCACGGTTTC TGGTACGCTT AATGCTTCAT CAGTTGATGG TAAAGGTGGT
CAGGTTGTGG TTACAGGCAA GCAGGTAATG ATTGCCGATG GTGCTCACCT TAATGCATCA
GGTCTTACGG GTGGTGGCGA CGTGTTGGTT GGTGGTAGTT GGCAAGGTAG CGATGCCTCC
GTTCGTCAAG CCGTTGGCAC GGTGGTAATG CCTAATACGC TTTTGCAAGC TAATGCAATT
AGTAATGGCA ATGGTGGTAC AGTGGTTGTA TGGTCGGATG TTAACAATCC GCTTTCAGTT
ACTCGCGCTT ACGGCACGTT TGAAGCCTTT GGTGGAACAA ATGGCGGCAA TGGTGGACGT
ATTGAAACTT CTGGTCATTG GCTTGATGTT GCAGGTTCAC GCGGTGGCGC TTCGGCGGTA
AATGGCAATG CGGGTGTGTG GCTGCTTGAT CCGTATAACG TAACGATTTC TTCATCTAAT
GCTAATGGTT CTTGGGGTGG TGTTTTTCCC AATGCTATTT GGACTGCAAG CGGAGATAAC
TCGAATTTAC TTGCTTCTGA TATTACAACC CGACTTAATG CTGGCACGAG TGTTACGGTT
CAAACAGGCA CGGCAGGAAG TCAGGCTGGT GATATTACGG TTGATGGAGC TATCAACATG
ACCAATGATA GTGGGGAGGT GTCATTGCAA TTAGATGCAG CAGGAAGTAT TGCTATTAAC
AATAATATCA CCAACTCTAC AGGTACACTT CATCTTGTGT TTAACAGTGG AACAGGTGCT
ATTAGTGGAA CAGGTGCCTT AGGTAGTGGT CAGGGAAGAA CTCTTTTTAA TGTTGGTGCT
TCTACTGGTA CTTTTTCTGG AATAATTAGT GGTGCTAGTC GTACTGTTAC AAAGCAAGGA
GCTGGCACGC TAATATTTTC TGGAGCTAAT ACTTATGGTG GTTTAACTTC TATTGAGGCT
GGAGTATTAA GAGTAGCAAA TGCTCAAGGA TTAGGTGATG TAACAAATGG TACTCAAGTT
TCTAATAATG GAGCATTGGA ACTTTCTGGG GGAATAGTAA TAACCGGAGA TGAGGTTCTT
CGTTTAGTAG GTACAGGGGT TAGCAATAGT GGCGCATTAC ATAGTATAGG TAATAATAGT
TTTGGAGGAC ATATTATTCT TACCGGAAAT AGCACTATTA CTTCCGATAC CAATGGAACG
TTAATCTTAG GCAATGCCAG TCAAGGTATT TACGGTGCTT ACGGATTAAC TCTTTCAGGT
GGCGGCAGTG TTGTATTTAA TGGGGCTATT GGCGCTACGA TACCACTCGC CTCATTTCAT
GGATTAACAG GAACGTCTAT TGAGCTTAAT GGTGGTTCAA TTACAACGAC CGGCGTAATT
AGTGCTCTTG GTCAAGTTAA AGCTACTAAT CCATTAACGT TATCGTCTGG TATTAGTGAT
ATTTCGTTAT CAAATGAAAC TAATGACTTT ACAACGGTAA CGGTAACAAA TGCTGGTGCA
GTATCGCTCA TTGATGATAC TGCATTGACG TTAGCTGGTG TTAATGCAAG TGGTGATGTG
AATATTGCAA CCCATACTGG TAATTTGACG GTTACGGGTA ATGTTGCAAC AACAAGTGCA
ACACCAACCG CGTTAACCTT AAATGCTGAT CAAAGCAAAG ATGCTGGTAA TGGCAATGGA
GAAAATCTCA TTCTTTCAAG TGGTACTCTT ACTGTTGGTT CGGGTGGTAT TGCTAAACTT
TATACTGGCA GTGTAGCTGG TAGCACATCA ATTGCTTCAG TTGTTAATGC AGGTCATTTC
CGCTACAATA GTGATGAAGC GGTACAACAT TACACTGATC CATTAACTGC TGGTTTAAAC
CTCATTTATC GTGAGCAACC AACGCTTTCA GTTATGTTTG CTCCTGTAAC TACGACGTAT
GGTACAACTC CAACATTTGC GATAAGTTCG TATAGCGGTT ATATAAATGG AGATACTTCC
CCAGGAATTG TTACTGGTAC GCCAACATGG TTGGTAGATG GAACGCCTTC TTTTGCAGGA
TATTATACTG CTGGTACTCA CAATGTTTCA TACAATAATG GACTTATCAG TAGTCTTGGT
TATGGTTTTG TTGATAATGC AATTAGTTTT AATGACTTAG TTGTTAATCC ACTGGTGTTA
GCAGCGACAT CTTTAACGGG TTTAACGGCA TCAGATAAAA TATACGATGG TCAAATAACT
GCTACTATAA GTAATTATGG TACGCTGACA GGAATACTGA CGGGAGATCG TGTTGCATTA
AATAGTGCTG GATCAAGTGC AGCTTTTGCA GATAAAAACG TCGGTACCGG TAAGACGGTA
ACGGTAAGTG GTTTAACGTT GTCTGGACTT GATAATGGTA ACTACCGTAT AGTTCCACAA
ACAACGACAG CATCTATTAC CCAAAAATCG CTAAATGTTA CAGCTCCAAG CAACGTAACC
AAAGTGTATG ATGGCACGGT AGCGGCTCCA GGTGTTGCTA CCGTCACTGG TCTTGCCATC
GGTGATGTTG TAGCGGGAAC AGCAACTATT GAATATGCCG ATAAAATGGC TGGCAGTAAT
AAGGTTGTTA ATCCGTTAAG TGTAACGATT CTCGATGGGT TCGATATGAT TATGACTAAC
AACTATGCAA TTACTTATGT TGGCGATCAT GGTACTATTA CCCAAGCACC ATTAACGCTT
ACAGCGCCCG ACAACGTTAC GAAGTATTAT GATGGATTAC TTACTGTTCC AGGTACTCCA
AGTGTTAACG GTTTAGTCCC TAACGATGTG GTGGTTATAC CGGCATCTCT GCTTTATACT
GATCCTGAAG TGGGAATAGG TAAAACTGTT AATCCTGATT CAGCAGGCTT GGTGATTCAT
GATGCTATAG GTAATAATAT GACTCCAAAC TATGCCATTA CTGATATAGC AAGCCACACT
GGTATTATTG TTGAAAAAAC CTTTACACCA TTTAAAAAAT GGAATGATGC TGATCCTTCG
GTGCCCGAAA TACCCACTAA TGCCCCTGAG GTAACCGGTT CTCGCGATTT AGCGGGAAGC
GATTTTGAGC CGGCAACCGA TAGTGGAGTA ACAGCTACTC GCTCACTCAC AATGGCTACC
ATGGATGAAA GTGCAGTGCA ATCTGATATT GTGGTGAAGC TTGCGGAGCC TGCATCTAAA
AATAAGCAAG GTGTGGTTAA GGTTTTTGTA CCAAAAGAGG TGTTTGCAAA GCCTGCTTTC
TTGTTCCCAT TACCTGAGGA GGTAGCTGTT GAGATAAATA AAACTAACGT GCAGGAGAAG
GTTTTCATGC AAAATGGTGA TGCGCTGCCG GGTTGGTTAA GTTATGACTA TGAGAAAAAA
ATCTTTACAG CAACAAGCGC TCCTGCTGGT TCGTTACCGC TTACGATTAT GGTTCAATCG
GGGACGATGG CTTGGCAAGT GATTATCCAA CAGTAG
 
Protein sequence
MNRIFNVIWS VTREKWVVVS EKVKSNGSVP KSSLVSIAFL SALLGGGSVA QAVEPGQLPT 
GGVITAGSGS IATNGNSMTI QQSSQKMVAN WNNFNVGSDA SVRFQQPNAS AAALNRIAGQ
NPSQILGSLS ANGRVFLINP SGIVFGQNAR VDVGGLVAST LDISDYDFLA GNFAFRSTGS
AGTLRNEGLI NAMPGGVVAL LSPSVINNGT ITAVGGSVAL AAGNQMTLDF GGDGLMTVRV
DDGAVNAFVE NNSLIKADGG LVVMSAKAAN NLAFSAVNNN GVVQAMSVVE KNGRILLDAE
GGQSTVSGTL NASSVDGKGG QVVVTGKQVM IADGAHLNAS GLTGGGDVLV GGSWQGSDAS
VRQAVGTVVM PNTLLQANAI SNGNGGTVVV WSDVNNPLSV TRAYGTFEAF GGTNGGNGGR
IETSGHWLDV AGSRGGASAV NGNAGVWLLD PYNVTISSSN ANGSWGGVFP NAIWTASGDN
SNLLASDITT RLNAGTSVTV QTGTAGSQAG DITVDGAINM TNDSGEVSLQ LDAAGSIAIN
NNITNSTGTL HLVFNSGTGA ISGTGALGSG QGRTLFNVGA STGTFSGIIS GASRTVTKQG
AGTLIFSGAN TYGGLTSIEA GVLRVANAQG LGDVTNGTQV SNNGALELSG GIVITGDEVL
RLVGTGVSNS GALHSIGNNS FGGHIILTGN STITSDTNGT LILGNASQGI YGAYGLTLSG
GGSVVFNGAI GATIPLASFH GLTGTSIELN GGSITTTGVI SALGQVKATN PLTLSSGISD
ISLSNETNDF TTVTVTNAGA VSLIDDTALT LAGVNASGDV NIATHTGNLT VTGNVATTSA
TPTALTLNAD QSKDAGNGNG ENLILSSGTL TVGSGGIAKL YTGSVAGSTS IASVVNAGHF
RYNSDEAVQH YTDPLTAGLN LIYREQPTLS VMFAPVTTTY GTTPTFAISS YSGYINGDTS
PGIVTGTPTW LVDGTPSFAG YYTAGTHNVS YNNGLISSLG YGFVDNAISF NDLVVNPLVL
AATSLTGLTA SDKIYDGQIT ATISNYGTLT GILTGDRVAL NSAGSSAAFA DKNVGTGKTV
TVSGLTLSGL DNGNYRIVPQ TTTASITQKS LNVTAPSNVT KVYDGTVAAP GVATVTGLAI
GDVVAGTATI EYADKMAGSN KVVNPLSVTI LDGFDMIMTN NYAITYVGDH GTITQAPLTL
TAPDNVTKYY DGLLTVPGTP SVNGLVPNDV VVIPASLLYT DPEVGIGKTV NPDSAGLVIH
DAIGNNMTPN YAITDIASHT GIIVEKTFTP FKKWNDADPS VPEIPTNAPE VTGSRDLAGS
DFEPATDSGV TATRSLTMAT MDESAVQSDI VVKLAEPASK NKQGVVKVFV PKEVFAKPAF
LFPLPEEVAV EINKTNVQEK VFMQNGDALP GWLSYDYEKK IFTATSAPAG SLPLTIMVQS
GTMAWQVIIQ Q