Gene Lcho_4103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_4103 
Symbol 
ID6160061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp4607402 
End bp4609459 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content72% 
IMG OID641666881 
Productpatatin 
Protein accessionYP_001793120 
Protein GI171060771 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000427908 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGCCG ACGACCGCGA GCCGCCGGAA GACCGCTTCT GCGACCTGGT GCTGACCGGC 
GGGGTGACCA GCGGCGTGAT CTACCCGGCG GCGATCCACG CGCTGTCGCG CAAGTACCGT
TTCCACGCCA TCGGCGGCAC CTCGGCCGGC GCGATGGCCG CGGCGCTGAC CGCGGCGGCC
GAATACGCGC GCCGCTTCGG CTCGTCAGCC GGCTTCGAGG TGCTCAAGGA GATCCCCGAG
CGGCTGGCCC GCCCGGTCGG CCAGGGCCGC GACGCGAAGA CCAAGCTGCT CGCGCTGTTC
CAGCCCTCGC CGCGGTGCGC GCGGCTGTTC GAGGTGCTGC TGGACGTGAT CGGCACCGAC
CTGCAGCGGA CCTTCGACGG CCTGTTCAAG CGCAGCGTGA AGGCCATCGC CCGGGTCTAC
GGGAATGGGC CGGCGGCCAT CGTCACGGGC GTCTTCCTTT CGAGCCTGGT GGCGATCGCG
TCGGGCCTGG ACACGACCGT CGAGCGCGCC AGCGTGGCGG TGCTCGCGTT CGTCATCGGC
ATCGGCGTGG TCATCGCGGT CATAACGGCC CGACTCCTGC GCGACGTGCG CGACGGCCTG
ATCGCCAACG ACTACGGCCT GTGCACCGGC CTGTCGGCAT CGGACACGCC GGACGAGGAC
TCGGAGGCGG TGATCGACTG GCTGCACAAG GGCATCCAGT GGGCCTGCGG TCGCAGCACC
GACGACACGC CGCTGACGTT CAAGGACCTG TGGGAAGCCC CGGCCACCGG CCTGCCGCCG
CCCAAGCCGC GCCAGGCCGG ACGCCGCGCG ATCGACCTGC GCATGGTCAC GACCAGCCTC
ACGCACGGCC GCCCCTACGA GCTGCCGCTG GCCGACACGG CCGAGAACGA ATGCCTGTTC
TTCTGCCTCG ACGACTGGCG GCGCTACTTT CCGCCGCGCG TGATCGAGCA GCTGCGCGCC
ACCTGCACGC CTTACGGCGC GCTGGCCGGG CGCTTCGACC ACGCGCCCGA ACCCGAGGGC
AAGGTGCTGC TGCAGATGCC GCGCGCCCGC ATGCCGCTGG TGGTGGCGGC ACGGCTGAGC
CTGAGCTTTC CGGTGCTGTT CTGCGCGGTG CCGGCGTGGG GGTTCGATCC GGCGCTCAGG
CAGTTCCGGC CTTGCTGGTT CTCGGACGGC GGCATCTGCG CCAACTTCCC GATCCACCTG
TTCGACGCGT CGATCCCGAG CTGGCCGACC TTCGGCGTCT TCCTGACCGA GAACCGCAAC
GGCGATGACG AAAGCAATGT CGAGGTCACG CACACCCACG ACGCGGGCGC CCACGATGTC
TGGAGCGATT TCGGCGATCC GAACGAGCCG CCGTCGCTGG CCCGGCTGGG CCGCTTCGCC
TGGAGCATCG TCAACACCGC GCGCCGCTGG CACGACCACA CGCTGGCGCG CATGCCAGGC
GTGCGCGACC GCGTGGTGCG GGTGGCGCTG GGTCGCGACC AGGGCGAGCT GAACCTGAAG
ATGAAGCCGG ACGACATCCT CCGGATGGTC GACGAATGCG GCCTGAAAGC TGCGCAGGAA
CTGATCGCCA AGTTCGCCCC GGACGACACG CAGGGCTCGG TCGCCACGGC CTGGTCGGAG
CACCGCTGGG TGCGCTTCAA CGCCATCGTC GACGGACTGC GCGAGCGGCT CGACGGCCTG
GCCGATGCGG CCGAGGCGGC GCCCTACTCG CTCAAGATCT CGCAGCAGAT CGAGCGTGCC
GGCCAGCACA GCCCGCTGCG CCGCGGCCAC GCCGATCGCC TGCCGCGCGG CCCCTGGCAG
AGCGCGCCGC CACCGATGTT CGAGGCCGAA GACGAGACCG GCAGCGGATC GGGCGACGCC
GCCGCCCGCA GGGCCGCGCC GCTGGGCCAG CACCGCCGCG ACACCGGCGA GGACCGGCTC
GACGACGACC AGCGCCAGGC GCTGGCGGCG CTGCTGGATG CGCTGGTCGA GTTCGAGCAG
AAGCTGCAGC AGAGCTGGCA CGAGCAGCCC TACGTGCCGA ACCCGCGGCC GCGGCTGGCG
GTGCGACCGC CGATGTGA
 
Protein sequence
MSADDREPPE DRFCDLVLTG GVTSGVIYPA AIHALSRKYR FHAIGGTSAG AMAAALTAAA 
EYARRFGSSA GFEVLKEIPE RLARPVGQGR DAKTKLLALF QPSPRCARLF EVLLDVIGTD
LQRTFDGLFK RSVKAIARVY GNGPAAIVTG VFLSSLVAIA SGLDTTVERA SVAVLAFVIG
IGVVIAVITA RLLRDVRDGL IANDYGLCTG LSASDTPDED SEAVIDWLHK GIQWACGRST
DDTPLTFKDL WEAPATGLPP PKPRQAGRRA IDLRMVTTSL THGRPYELPL ADTAENECLF
FCLDDWRRYF PPRVIEQLRA TCTPYGALAG RFDHAPEPEG KVLLQMPRAR MPLVVAARLS
LSFPVLFCAV PAWGFDPALR QFRPCWFSDG GICANFPIHL FDASIPSWPT FGVFLTENRN
GDDESNVEVT HTHDAGAHDV WSDFGDPNEP PSLARLGRFA WSIVNTARRW HDHTLARMPG
VRDRVVRVAL GRDQGELNLK MKPDDILRMV DECGLKAAQE LIAKFAPDDT QGSVATAWSE
HRWVRFNAIV DGLRERLDGL ADAAEAAPYS LKISQQIERA GQHSPLRRGH ADRLPRGPWQ
SAPPPMFEAE DETGSGSGDA AARRAAPLGQ HRRDTGEDRL DDDQRQALAA LLDALVEFEQ
KLQQSWHEQP YVPNPRPRLA VRPPM