Gene Lcho_4093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_4093 
Symbol 
ID6159941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp4588782 
End bp4591622 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content74% 
IMG OID641666871 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_001793110 
Protein GI171060761 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000114159 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTGCTGGC CTGTCATCCC GAATTCCCGC ACCGGCCCCC CACCGGCACG CGTCGCCGCG 
CTCGCGCTGG CCGTGCTGGC GCTGCTCGGC GCACCCGCCG TCCGCGCCCA GACCAACCTG
CCGGTGCCGG CGACCACCGC GCCGCAGAAC GCCACCATCA GCACGCCCGC CGTCGGTCAG
ATGGTGATCG TGCAGAACGA GAGCGCGCCA CGCGCCTACA TCGAGTGGCG CGATTTCTCG
ATCGGCGCGC AGGCCGGCGT GACGGTGCAG CAGCTCAACG GCAGCTCGGT GCTGCTCAAC
CGCGTGGTGG GCGTGGGCGG CGCGCCGCCG ATCAGCCGCA TCGACGGCAC GCTCGGCGCC
ACCGGCCAGG TCTTCATCCT CAACCCGGCC GGCATCGTGT TCGGCGCCGG CGCGCAGGTG
CAGGTCGGCA GCCTGCTGGC GGCGGCGCTG GACACCTCGA CCGGCGGCGC CGACTTCCTG
TCCGGCGCGC CGCTGGTGCT CGATGCCTCG CTCGGCGGCG GCGCGCTGAG CGTCGCGCCG
GGCGCCCGCA TCGCTGCGGC GGCGGGCAAC CCCAACCTGC CCGGGGGCGA GATCGTGCTG
ATCGGCGCCG GTGGCGCAGC CGGGGCGGGC GGCCTGACGA TGGACGGCCG CGTCGTCGCC
TCGGGCGGCC AGGTGCTGCT GGCGGTGAGC GACGGCACCA GCGTGCCGGC CGCCGGCATC
CCGGTCGGCG CCAGCGGCTT CATCACGCTG CAGCTGACCC CGGCCGCGGC CGGCACGCTG
GCGATGCACG GCAGCGTCGA CGTGGCCGGC GCCGTCAGCA CGCCCAGCGG CAACCTGCGC
ATCGAGGCGC CGACGGTGCA GATCGGCCGC CGCGCGGGCC TGGCCGCCAC CTCGGTGTCC
TTGAGTGGCG CCAGCGTCAC CGCGCTCGGC AGCGCCGGCC CGGGCAGCCT CTCGATCAGC
GGCAGCGCCG ACGGCGGCTT CGGCATCGAC CTGAACAACG TCACGCTCAG CGGCGACCGC
ATCGAACTGC GTGGCACCTC GCTGCTGACC GATGGCAATG CCGACCTGCT GCCGATCGGC
GTGCGCCTGA ACGGGGTGTC GATCGACATC GGCGACGGCA GCCTGCTGAT CGCCGGGCGC
GGCGAGGTGG CGTCGACCTT GTCGCCCGAC ACGTCCCCGG CGTTCGGCGT GGGCCTGTCG
GATCTGTTCG TCACCAGCAA CGCCAACGCC GGGCGCTCGG TCACGATCGT CGGCGAGGCG
GTCAACTCAT CGACCGGCGC GGGCATCGGC GCGGTCAACG ACGGCTTCTT CATCATCGCC
AGCAACAGCG AGGTCGACCC GTCGGCCGCG AACGTGGTGC TGGCCGGGTA TGCCGGCGCG
CAGGGTCGCG CCTACGACCT GTTCGGCGCG CCCGACGTGT TCACCACCGG GCGCGTCAAC
CTGCGCCCCG CCGGTGTCGA CGCCGACGGC ATCGTGCAGG AACGCCCGGC GGTGCCGATC
ACGCTCGGCG GTGCCGTCTC GAGCGTGCCG CTCGGCTTCA ACCTGCCGTC GGCCTGGCTG
CTCGATCCGC GCATCAACCC GGGTGGCAAC ATCGAGAGCG CCGGCATCGT GGTCGGCTCG
AGCGGCCATG TCGGCGCGAT CGCGGTGGCG GCCGACGCCC TGCAGGCCGG CATCACGCCC
GAGCTGACGC TGCACAACGG CGGCGCCGGC GCGCAGGGCA TCCAGCTCGA CGGCGGCCTG
ACGACGACCG GCGCGGTGCG CCAGCTCACG CTGGTCAGCG CCGGTGCGGT GACGCAGACC
GGCCCGGTCG CGGTGCAGCA GCTGCTGCTC GCCGGCACGG GGGCGGATGC CACGGTGCAG
CTGCTCGACC CCGGCAACAC GATCGACCAG ATCGGCTTCA CCGGCCTGCG CAGCGTGCAG
GTGGCGAGCG CCGGGCCGCT GTCGGTGGCG GGTGGCAGCG TGGCGGCCTA CGACAGCGCG
AGCGGCAGCT TCACCCCGCA GGCCTTCACC ACCAGCCTGG CCAGCGATCG CGTGCTGCTG
CGTGCCGACG ACGGCGACCT GACGCTGCAG CAGGGCATTC GCGCCAGCGC GGCCGGTGGC
CAGATCGACC TGGTGGCCGG CGTGCGGTTC CAGAACCCGG CCAACGCCAC GCTGGAGGTC
GGCGCGGGCG GGCGCTGGCG GGTCTGGTCC GACAGCTGGG TCGACAGCCA GCGCGGCGAG
CTGCCCGGTC GCGCCAGCCT GCCCACGCTC TACGGTTGCA GCTACGGCGA CGCCAGCCTG
TGTTCGGTGT CGCAGATCGC CTTGCCCGAG GCCGGCAGCG GCTTTCTCTA CAGCGCGCAG
CCGGACCTCG TGATCGTGCC CGACCCGGTC AGCGCGGCCC AGGGCACCTT CCTGCCGCCG
ATCCCCTACA GCGCGACCGG CCTGGTCAAC GGCGACGTGC AGCCGCAGGC CGTCACCGGC
CAGCTGGCCA GCAGCGCCGG CCTGCTCAGC CCGCCGGGGC TCTATCCGGC GACGGCGGGC
ACGCTGCAGT CGCCCACCGG CTACCGGCTC AGCCTGGCGT CGGGCACGGT GCTGCCGGTG
CGCATCACGC CGCCCGACCC GCTGCGCAGC GCCTATCCCG ACGGCATGGC GCAGCTGCTG
GCGGCGTCGG CCTCCGAGAC CCACGGCCGC AACCTCGCCA CGCCACGCAT GTGCCTGGCC
AGCGGCCCGG CCCGCACGGC GGCGATCGAC GACGCCACGG CCGATCTGCT CGGCCTGGAA
TGGGGCCGCG TGCGCCAGCA GCCGCAGTTC TCGAGCTGCC TTGAGCTCGA TCGTGGTGGC
GGCGCCTGTG CCGGTTTCTA G
 
Protein sequence
MCWPVIPNSR TGPPPARVAA LALAVLALLG APAVRAQTNL PVPATTAPQN ATISTPAVGQ 
MVIVQNESAP RAYIEWRDFS IGAQAGVTVQ QLNGSSVLLN RVVGVGGAPP ISRIDGTLGA
TGQVFILNPA GIVFGAGAQV QVGSLLAAAL DTSTGGADFL SGAPLVLDAS LGGGALSVAP
GARIAAAAGN PNLPGGEIVL IGAGGAAGAG GLTMDGRVVA SGGQVLLAVS DGTSVPAAGI
PVGASGFITL QLTPAAAGTL AMHGSVDVAG AVSTPSGNLR IEAPTVQIGR RAGLAATSVS
LSGASVTALG SAGPGSLSIS GSADGGFGID LNNVTLSGDR IELRGTSLLT DGNADLLPIG
VRLNGVSIDI GDGSLLIAGR GEVASTLSPD TSPAFGVGLS DLFVTSNANA GRSVTIVGEA
VNSSTGAGIG AVNDGFFIIA SNSEVDPSAA NVVLAGYAGA QGRAYDLFGA PDVFTTGRVN
LRPAGVDADG IVQERPAVPI TLGGAVSSVP LGFNLPSAWL LDPRINPGGN IESAGIVVGS
SGHVGAIAVA ADALQAGITP ELTLHNGGAG AQGIQLDGGL TTTGAVRQLT LVSAGAVTQT
GPVAVQQLLL AGTGADATVQ LLDPGNTIDQ IGFTGLRSVQ VASAGPLSVA GGSVAAYDSA
SGSFTPQAFT TSLASDRVLL RADDGDLTLQ QGIRASAAGG QIDLVAGVRF QNPANATLEV
GAGGRWRVWS DSWVDSQRGE LPGRASLPTL YGCSYGDASL CSVSQIALPE AGSGFLYSAQ
PDLVIVPDPV SAAQGTFLPP IPYSATGLVN GDVQPQAVTG QLASSAGLLS PPGLYPATAG
TLQSPTGYRL SLASGTVLPV RITPPDPLRS AYPDGMAQLL AASASETHGR NLATPRMCLA
SGPARTAAID DATADLLGLE WGRVRQQPQF SSCLELDRGG GACAGF