Gene Lcho_1124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_1124 
Symbol 
ID6163738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp1200420 
End bp1203473 
Gene Length3054 bp 
Protein Length1017 aa 
Translation table11 
GC content73% 
IMG OID641663878 
Productgeneral secretion pathway protein E 
Protein accessionYP_001790158 
Protein GI171057809 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.553401 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAAA TTTCCCTGCC GTCCCTGTCG CTGGTCCCCA CGCCCGACGA GCTGCGTCGC 
GGCACCGGCG CGCTGCCCGT GCAGGCGCCC GGCAAGGCGT CGGCGTCGAA GAAATCCTTT
GCCTGGCCGA CTCCGCCGCT GGCCGCCTAC CCGCTGCCGG TGCCCGCGCT CGAGCCCGAG
CCGTGCGAGA TCGAGGGCCG CACCGGCAAC CTGATGGCCG GGCTGATGGT GTCGTTCGAG
CCCGAAGAAG GCCTGGTGCG CGTGCAGGTG CCGCCCGAGC GGGTGCCGAT GGCGCTGCGC
TTCAACCAGA TCCGCCGCCT CACGCTCAAG CGCAGCCTGG CACCGCTGAG CGCGGCGATG
GTGGCCGACG CCGCCGGCGA TCCGCCCGAC CAGGTGCTGG CGCATCACCA GAGCCAGTCG
TACACGGTGC AGCTGATGGG CGGCGCCAGC ATGGCCGGGC GCACGGTCGG CCATGTCGAG
ATGCCGGTCG GCCTGTTCCT GTTCGCGCCG CTCGACGGGC TGGGCTCGGT GCAGCGCATC
TTCGTGCCGC GGGTGGCGAT CGAGCAGTTC CAGATCGGCG AGCGGCTCGG CCAGATGCTG
ATCGAGCAGC AGGCCACCAC GCCCGAACAG CTCGAACAGG TGCTGCTGCA GCAGCAGACC
CAGCGCCAGA AGAAGCTCGG CGAACAGCTG GTCGAGCGCC AGATCGTGAC CCCCGAACAG
CTGCTGACCG CGCTCGACAA GCAGGCCCGC ATGCCGTCGG TGCGCCTGGG CGAGGCGCTG
GTGGCGCTGG GCTATCTGAC CGACAAGCAG CTGCAGGAAG CCCTGCAGCT GCAGCGCACC
GACCGGGTCC AGCCGCTGGG CGAGCTGCTG GTCGAAAAAG GCCTGGTCGA AGGCGAGCAG
CTGCGCATCG CGCTGGCCCG CAAGATGGGT TATCCGGTGG TCGATGTCGC CGGTTTCCCG
GTCGACCCGG CCTTGATCCC GCTGCTGCCG GCACCGGCCG CCCGGCGCCT GCAGGTGTTG
CCGCTGATGC GCCGCGGCGG CCGGCTGGTG GTAGCGATGC ACGACGCCAG CCAGCAGTCG
GTGATCGAGG AACTGCAGCG CCTAACGCAA TCCCACATCG CCCCGACGCT GGCCGGCGGC
ACCGGCCTGG CGGAGGCGAT CGAGCGCGCC TACAGCCAGG TGACGTCCGA CCTGATCGAA
TACACGCCGG TCGACGACCT GCCGGCACGG CGCACGCCGA TGCCGCCCGC GCATTTCCCG
GCCGCCGCGC TGCGCCGGGC GGCGGGCCCG TCGGTGGATC TGCCGGTGGA GCTGCCCGTG
ACCGTCCCGA CCCTGCCGCC ACCGGCGCCT CCAGTCGTCG TGCAGGACAC GCTGCGGCCC
AACGAAGGCC CGATGTCGGT CGCCGACGCG GCCGACGCGG GTTTCCCGCT GATCTCGCTG
GAGGCGATGG GTGTCGAGGT GACGGAGCTG ACCGGCCTGG CCGGCCACGA CGCGGCCGAC
CTGCCGCACC CGGTCGACCA CACGCCGACC CAGCCGCTGG CCCATCCGTC GGACCCTCAC
GCGGCAGCCA CCCACGAGCA CGCCGAAGTC GACCTGCCGA TCACCGTGCC GGCCGGCTGG
CCGGTCGACG CCCCGACGGT CGATGTCACG CCGGCGCAGG CTCCGTCGAT CGAGGCTCGC
CACGCCGAAC CCGCGCCCGT CGCTGCGCCG TCGCCCGGCA AGCCGGTGCC GCGCCACGAG
CGCCAGGCCC AGGCCGCCGC CGCCGATGCC GAGGCGTCCA GCCGGGCCCG CAGCAGCGCC
GATCGCCACG AGAGCCCGCT GCTGCAGACG CTGGCCAACC TGGTGCTCGA TGCGCTGGGC
CGCGGCGCCA GCAGCGTTCA GATCGAAACC CTCGGGCCCG ACGACAAGCT GCAGGTGCGC
CTGCGCCGCA ACGGCCGGCT CGAGCCGCAC ACCGATCTGC CGGCGACCTA CCGGGTGCCG
CTGATCGCGC GCATCAAGGC CCTGTGCGAG CTCGACGTCA GCGAAACCCG CCGTCCGCAG
GAGGGCCGGC TGGCGTTCGG CCGACTGGTG CCGCAGCACA AGATCGACCT GCGGGTGCAC
GTGCTGCCGA CCCAGAACGG CCTCGAGGAC GTCGTGATCG GCCTGCCGTC GCGCCTCAAG
CCGATGGCGC TGGATGCGCT GGGCATGGCA GCGCCCGAGG TCGAGCGGCT GAAGGGCCTG
CTCGACCGGC CGGCCGGCCT GATCCTGTGT GTCGGCCCGG CGCGATCGGG GCGCACCACC
AGCCTGCACG CCAGCCTGGC ACACCTGAAC CGCCCCGAGC GGCGCATCTG GACGCTCGAG
GACCGCATCG AGCTGACCCA GCCGGGCCTG CGCCAGATGC AGGTCCACCC CGACGAGGGG
CAGACCTACG AATCCGGCCT GCGCACCCTG CTCAACACCG ACGCCGACGT GCTGATGGTC
GGCCACATCG GCGACGTCGG CACCGCCCGC GTGGCGGTCG ACGCGGCGCT GCAGGGCCGG
CTGGTGATCG GCGCGATGAC CGGTCGCAAC GCCTGCGACG CCGTGATGCG GCTGATGGAC
CAGGGCGTCA CGCCGTGGGA TCTGTCGGAC GCGCTGCTGG GCGTGCACAG CCAGCGCCTG
CTGCGCCGCA TGTGCAGCGC CTGCCGCATG AGCCGCAGCG CCAAGGAGAC CGAGATCGAG
GAATGGGTGG AAGGCTATTT CCACGGCGCC GTGGTGGCCG ATCCGCTGCC CGAACGCGAG
GCGCTGCTGC GCAGCTGGCT CGAGCGTTTC GGCCGCGAAG GCCGGCTGCG CCGCTTCCAG
AGCCCCGGTT GCGAGCGCTG CGGCCACACC GGCCAGCGCG GCCGGCTGGC GGTGCACGAG
CTGCTGGTGG TGACGCGCGA GCTGCGCCGC CTGATCCGCG CCGGTGCGCC GGCCTGGAAC
CTGCAGCGCC AGGCCCAGAA GGACGGCATG CGCACGCTGC GCCAGGAGGC GGTCGAGAAG
ATGGTGGCCG GCCAGATCAC GCTCGACGAG GTGCGCACGG TGCTCGACCT GTGA
 
Protein sequence
MSQISLPSLS LVPTPDELRR GTGALPVQAP GKASASKKSF AWPTPPLAAY PLPVPALEPE 
PCEIEGRTGN LMAGLMVSFE PEEGLVRVQV PPERVPMALR FNQIRRLTLK RSLAPLSAAM
VADAAGDPPD QVLAHHQSQS YTVQLMGGAS MAGRTVGHVE MPVGLFLFAP LDGLGSVQRI
FVPRVAIEQF QIGERLGQML IEQQATTPEQ LEQVLLQQQT QRQKKLGEQL VERQIVTPEQ
LLTALDKQAR MPSVRLGEAL VALGYLTDKQ LQEALQLQRT DRVQPLGELL VEKGLVEGEQ
LRIALARKMG YPVVDVAGFP VDPALIPLLP APAARRLQVL PLMRRGGRLV VAMHDASQQS
VIEELQRLTQ SHIAPTLAGG TGLAEAIERA YSQVTSDLIE YTPVDDLPAR RTPMPPAHFP
AAALRRAAGP SVDLPVELPV TVPTLPPPAP PVVVQDTLRP NEGPMSVADA ADAGFPLISL
EAMGVEVTEL TGLAGHDAAD LPHPVDHTPT QPLAHPSDPH AAATHEHAEV DLPITVPAGW
PVDAPTVDVT PAQAPSIEAR HAEPAPVAAP SPGKPVPRHE RQAQAAAADA EASSRARSSA
DRHESPLLQT LANLVLDALG RGASSVQIET LGPDDKLQVR LRRNGRLEPH TDLPATYRVP
LIARIKALCE LDVSETRRPQ EGRLAFGRLV PQHKIDLRVH VLPTQNGLED VVIGLPSRLK
PMALDALGMA APEVERLKGL LDRPAGLILC VGPARSGRTT SLHASLAHLN RPERRIWTLE
DRIELTQPGL RQMQVHPDEG QTYESGLRTL LNTDADVLMV GHIGDVGTAR VAVDAALQGR
LVIGAMTGRN ACDAVMRLMD QGVTPWDLSD ALLGVHSQRL LRRMCSACRM SRSAKETEIE
EWVEGYFHGA VVADPLPERE ALLRSWLERF GREGRLRRFQ SPGCERCGHT GQRGRLAVHE
LLVVTRELRR LIRAGAPAWN LQRQAQKDGM RTLRQEAVEK MVAGQITLDE VRTVLDL