Gene Cag_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1053 
Symbol 
ID3747034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1425092 
End bp1429435 
Gene Length4344 bp 
Protein Length1447 aa 
Translation table11 
GC content44% 
IMG OID637773582 
Productfilamentous haemagglutinin-like protein 
Protein accessionYP_379358 
Protein GI78189020 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain
[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0559145 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGGG TATTTAATGT GATCTGGTCA ATTACCAGAG AAAAATGGGT TGTGGTTTCA 
GAAAGAGTTA AATCAAATGG CTCGGTGCCA AAATCATCAT TAGTGAGCAT TGCTTTTCTT
TCGGCATTGC TTGGTGGAGG CAGTGTTGCG CAAGCGGTAG ATGCCAACCA GTTACCAACG
GGCGGTGTTA TTGCCGCTGG TAGTGGCTCT ATTGCGGCAA GCGGCAACAG CATGACGATT
CAGCAGTCAA GCCAAAAGAT GGTTGCTAAT TGGAGCAGCT TTAACGTTGG TAGCGATGCA
AGTGTGCGTT TTCAGCAACC AAATGCTTCA GCCGCAGCAC TGAACCGTAT TGCAGGACAA
AGTCCTTCAC AAATTCTTGG TTCACTTTCG GCTAACGGGC GTGTTTTTCT TGTCAACCCA
TCAGGTATTG TGTTTGGCAA AAATGCTCGT GTTGATGTTG GTGGATTGGT GGCTTCAACC
CTCAATATTT CCGATAACGA TTTTCTTGCA GGGAACTACG CTTTCCGCTC AACGGGATCG
GCAGGAACGT TACGCAATGA AGGTGTGATT AACGCAATGC CGAATGGCGT GGTGGCACTT
TTAAGTCCAT CAGTTGTTAA TAATGGCACT ATTAATGCAG CAGGTGGCAC GGTAGCACTT
GCCGCAGGTA ACGCTATGAC GCTTGATTTT GGCGGTGATG GCTTAATGAC GGTTCGCGTG
GACGAAGGTG CCGTGAATGC GTTGGTTGAA AATAATGCGC TCATTAAAGC AGATGGTGGG
CTTGTTGTAA TGAGTGCAAA AGCGGCGGAT GAGTTAGCAC TTTCAGCAGT AAACAGCAGC
GGTGTGGTGC AAGCAATGAG CGTTGTTGAA AAAAATGGAC GCATTTTGCT TGATGCCGAA
GGTGGGCAAA GCACCATTTC GGGCACACTT GATGCCTCGT CAGTTGATGG CAAGGGTGGT
CAGGTTGTGG TTACAGGCAA GCAAGTAATG GTTGCCGATG GCGCTCATTT AAACGCCTCA
GGTCTCACTG GTGGTGGCGA AGTGCTTGTT GGTGGTAGTT GGCAAGGTAG CGATGCCTCC
GTTCGTCAAG CTGTTGGCAC GGTGGTTATG CCCGGAGCTT TGTTACAAGC AAATGCAACA
GGCAACGGCA ACGGTGGCAC GGTGGTTGTT TGGTCAGATG TAAATAATCC ACTCTCTGTT
ACTCGTGCTT ACGGTACCTT TGAAGCGTAT GGTGGATTGT TAGGCGGTAA TGGTGGACGT
ATTGAAACTT CGGGACATTG GCTTGATGTG GCAGGCTCAC GCGGAGGAGC ATCGGCAGTA
AATGGCAATG CGGGTGTGTG GCTGTTTGAT CCTTGGAATG TGATTATTGG TCCAGATCCA
ACAACGAGTG GAACATCGTT TACTAATCCA TTTAATCCCA CTGGAGATTC AACGATTCTT
GCATCGAACA TTAATACTTT GCTCAATGCA GGAACAAGTG TGTCCATTAC TACGGGTACG
GGAGGTACAG TTGGGGTAGG AGATATTTCG GTTAATGCTC CTATTTTAAA AACTACAGTA
ACAGGTCTTA ATACTTTGAC GTTAAGTTTA ATTGCTGAAG GAAATATTTT TATCAATAAT
TCCATTGGTA ATTCTTCGGG TACTCTCAAT CTCAATTTAA CAACGGTAAA TGGTGCAATT
AGTGGCACAG GAAATATTAC CGGTAATGGT AATGGAGATA CAATTTTTAC TGTTGGTGCT
GGAAGTGGTA CCTATAGCGG AAACCTTGTT GATCGTCGTT TTGTTGAAAA GAAAGGAGTA
GGAACCTTGA TTGTGTCTGG TGATAATAAT CATGATGGTG AAACAAGAAT TTCTGCAGGA
ACATTGGTGG TTCAAAGCTC GACCGCTTTA GGTAAAACAA CAAATGGCAC TCAAGTGGTT
GATGGAGCAA CTTTGCAATT AGAAGCCAAT ATTGCAGCAC AAGAATTACT TTATCTTGCA
GGTGATGGGG TTAATTCAAA TGGTGCTTTA AAAAATATTG GGGGAAATCA TGTCTATGGT
GGAGATATTA TTTTACTTAA CAATAGTAGG ATAATGTCTG ATGCTAATAC ATTGACCTTA
AATGGTTCTG TTAATGGAGC ATATTCTCTT ACCGTAAATA GTGTCGGTAG TACAATCTTT
AATGGATTAA TTGGTAATTC AGCTCCTCTT GGTGCATTTA TAGGTACTGC TGGTACGCCA
ATTACTTTTA ATGGCAGTTC CATTACAACA GTAGGTGCAA TAAATGCTGC TGGAGTGGTT
ACAGCTTCTA ACCCATTAAC TATATCGGCA GGTGCTGGTA ATATCTCATT ATCAAATACG
GGCAATAATT TTAACTCGGT TAACATAACA AGCGCAGGCA CTGTCTCATT AGTTGATACT
AATGCTTTGG CGCTTACAGG TGTAAATGCA ACCGGAGATG TTTCAATTGC CACAAGAAGT
GGTGATTTAA CTATTGATGG TCATCTGTTA ACAACGAGTC CAACATCATC AGCAATGATC
CTTAATGCTG AACAAGCACA AATTGCAGGT AATGGTAATG GAGGTAATCT CGTGTTTTCA
AGCGGTACCC TTACTGTTGG TTCGGGTGGT ATAGCCACTC TTTATACTGG CAGTGTAGCT
GGTAGCACAT CAATTGCTTC AGTTGTTAAT GCAGGTCATT TCCGCTATAA CAGTGATGAA
GCAATAAATG GCACGCATTA CACTGATCCA TTAACTGCTG GTTTAAACCT CATTTATCGT
GAGCAACCAA CGCTTCTTGT GGCTCCTGCT GCAACACCCA CACCCTATGG AACAGCTCCA
TCTTACACAC CATCCTATTC GGGAGCTGTT AATAATGATC CTACTGTTGG CACGGTTGCA
GGTACGCCAC AATGGGCATT TGATAATGCA ACAATACCAA CAAAATCCTT ATCTGGTCAA
GATGAAGTTG GTACGTATAA CGTAAAATAC GTTGGAGGTT TAACGAGTAC GCTTGGTTAT
GGATTTGCTG ACAATGGAGG AAATGGGGAA TTAACTATAG CTCCAAAAGA AATTGTTTTT
GGTAATGGTT TAACGGGTGG TGTAAATAAT AAAGTATATG ATGGAACACT TACAGGTACT
ATAACTCCAC TTGTGCTTTA TGTTGTTGCT GGTGATAATG TTAGCTTAAA TAGTACTGGT
GCCACCGCTA CGTTTTCCAA TAAAAATGTT GGTGTAGGTA AAACTGTTAC CGTTGCCGGA
TTAGCGCTTA CGGGGGATGA TGCAGGTAAC TACTCTATTG GTAACCAAAC AACAACAGCA
AATATTATCC AAGCCTCATT AACCGTTACT GCTCCTGGTA ATCTTACCAA AGTATATGAC
GGTACGGTTA CAGCTATAGG TGTTGCAACA GTAACAGGGC TTGTTTCTGG TGATACAGTT
GCAGGAACGG TAGCTATTGC TTATGCCGAT AAAATGGCAG GCTCAAGCAA GGCTGTGAAT
CCGTTGAGTG TAATGATTGT AGATGGTTCT GATATGAATA TGACCGGCAA CTATAACATT
GCCTACGTTC CGACTGTCAA CAATACCATC ACTCAAGCGT CGTTAACGCT TACATCGCCT
GATAATGTTT CAAAGTTTTA TGATGGTTTG ATGAGTGCTC CGGGTGCACC TATGGTTACT
GGTTTAGTGC CGAATGATGT GGTAGTTACA CCGGCACCAC TCTCTTATAA TGATCCTGAA
GTGGGAAACA ATAAAACCGT TTCACCAAAT CCTGCTGGAT TGGTTATACA CGATGCAAAT
GGTGGCGATA TGACTCCAAA CTATGTTATT ACGACAATTC CGCGTAATGA TGGAGTTATT
GTCGAAAAAA CCTTCACTCC ATATAAAGAA TGGAATGATA TTGATCCATC AACACCAGAA
GTTCCAACAG CCGCACCTGA AGTGAGTGGC AACCGTGATT TGGGGGATGT TGAGCTTGCT
GCGGATGATG GAGGCACAAC AGCTACTCGC TCGCTTGCCA TGGTAGCAAT GGATGAAACA
GCTATTCAGT CAGATATTGT GGTTACGCTT TTGGAGCCTG CGGCTAAGAA TAAGCAAGGT
GTGGTGAAGG TGTTTGTGCC AAAAGAGGTG CTTGCAAAGC CCGCTTTCTT GTTCCCACTG
CCTGACGATG TGGCAACTGC AATTAATCAA ACTGCCGTAC AGGAAAGGGT TTTCTTGCAA
AATGGTGATG CCTTACCTGG CTGGTTAAGC TATGACCGTG ATAAAAAAAT CTTTACCGCC
AAAAGCGCTC CAGCAGGTTC GTTACCGCTG ACGGTGATGG TTCAAGCAGG CAGTATGGCT
TGGCAGGTTA TTATTCAGCA GTAA
 
Protein sequence
MNRVFNVIWS ITREKWVVVS ERVKSNGSVP KSSLVSIAFL SALLGGGSVA QAVDANQLPT 
GGVIAAGSGS IAASGNSMTI QQSSQKMVAN WSSFNVGSDA SVRFQQPNAS AAALNRIAGQ
SPSQILGSLS ANGRVFLVNP SGIVFGKNAR VDVGGLVAST LNISDNDFLA GNYAFRSTGS
AGTLRNEGVI NAMPNGVVAL LSPSVVNNGT INAAGGTVAL AAGNAMTLDF GGDGLMTVRV
DEGAVNALVE NNALIKADGG LVVMSAKAAD ELALSAVNSS GVVQAMSVVE KNGRILLDAE
GGQSTISGTL DASSVDGKGG QVVVTGKQVM VADGAHLNAS GLTGGGEVLV GGSWQGSDAS
VRQAVGTVVM PGALLQANAT GNGNGGTVVV WSDVNNPLSV TRAYGTFEAY GGLLGGNGGR
IETSGHWLDV AGSRGGASAV NGNAGVWLFD PWNVIIGPDP TTSGTSFTNP FNPTGDSTIL
ASNINTLLNA GTSVSITTGT GGTVGVGDIS VNAPILKTTV TGLNTLTLSL IAEGNIFINN
SIGNSSGTLN LNLTTVNGAI SGTGNITGNG NGDTIFTVGA GSGTYSGNLV DRRFVEKKGV
GTLIVSGDNN HDGETRISAG TLVVQSSTAL GKTTNGTQVV DGATLQLEAN IAAQELLYLA
GDGVNSNGAL KNIGGNHVYG GDIILLNNSR IMSDANTLTL NGSVNGAYSL TVNSVGSTIF
NGLIGNSAPL GAFIGTAGTP ITFNGSSITT VGAINAAGVV TASNPLTISA GAGNISLSNT
GNNFNSVNIT SAGTVSLVDT NALALTGVNA TGDVSIATRS GDLTIDGHLL TTSPTSSAMI
LNAEQAQIAG NGNGGNLVFS SGTLTVGSGG IATLYTGSVA GSTSIASVVN AGHFRYNSDE
AINGTHYTDP LTAGLNLIYR EQPTLLVAPA ATPTPYGTAP SYTPSYSGAV NNDPTVGTVA
GTPQWAFDNA TIPTKSLSGQ DEVGTYNVKY VGGLTSTLGY GFADNGGNGE LTIAPKEIVF
GNGLTGGVNN KVYDGTLTGT ITPLVLYVVA GDNVSLNSTG ATATFSNKNV GVGKTVTVAG
LALTGDDAGN YSIGNQTTTA NIIQASLTVT APGNLTKVYD GTVTAIGVAT VTGLVSGDTV
AGTVAIAYAD KMAGSSKAVN PLSVMIVDGS DMNMTGNYNI AYVPTVNNTI TQASLTLTSP
DNVSKFYDGL MSAPGAPMVT GLVPNDVVVT PAPLSYNDPE VGNNKTVSPN PAGLVIHDAN
GGDMTPNYVI TTIPRNDGVI VEKTFTPYKE WNDIDPSTPE VPTAAPEVSG NRDLGDVELA
ADDGGTTATR SLAMVAMDET AIQSDIVVTL LEPAAKNKQG VVKVFVPKEV LAKPAFLFPL
PDDVATAINQ TAVQERVFLQ NGDALPGWLS YDRDKKIFTA KSAPAGSLPL TVMVQAGSMA
WQVIIQQ