Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_1120 |
Symbol | |
ID | 3745086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | + |
Start bp | 1272668 |
End bp | 1274734 |
Gene Length | 2067 bp |
Protein Length | 688 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637769155 |
Product | hypothetical protein |
Protein accession | YP_375025 |
Protein GI | 78186982 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000538221 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.614169 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAATGGC TGCTGCGCAT CATACCCCTG CTCCTGCTGC TGCCGCTGCT CTTTTTGGCG GCAGCATGGG TCTTCTTTCC GCAGATTGCC CCCGCCCTTC TCCGTTCCGC TCTTGAAGCA CCGGGACGCC GTATAGAGAT AGCCGGCCTC TCCCGGCCCG GCTTCAACTC TGTAGGCTGC CATGCCCTCA CGGCCGATGT TGAGGTACCG CCGGGTCCCT GCACCCCTGA CACCGCCGCC ACCCTCTACC ATGCAACCAT CATCAGGCCG CAGTTGCGAT GGGATACCGA TCTTTCGAAG CTCCTCGCCA CGGGAAAAGC CACCATCACC CTTCGCCTCA CTGCTGACAG CCTGCAGTTC GGGCCAAGCT CCGGAGCATT CAGTTATTCC GATACCGACC CTCTTGCCGG CGTGGAGATG AGCATCTCGC TGAATGGTTT CAATGCACCG GCATTCACCC CGCTTGCCGC CTCGTATGTG ATCGATGATG CCGCCCTCGA GGCCGGCGGA TTCAAGGCCA GGGGCATCAG CGTCCCGCTC AGGGCCTCGG CCGACCGGGA ATGGGTGCCG GAAAAGGCCG ACATCAGCAT CCGGAGCCTC GAAAACAAAA CCGGTCCGTT GCCCGTTACC GCCATCCGGG CATCCCTCAT CGCTCTGCCC GACAGCAGCA ACCATTGCAC CCTCACCCTC AACCGCTCTT CCCTGGAACT CTTCGGCATG AAGGCCTCGA CCGACACGCT TGCCTACGAT ATGCTCCGGA AGCGGGCCGC ATTCCAGCTC GATATCGAAA ACGCCGATCT TATCCGCCTG CAGGGCACCA AAGCGGGCAA CCGGGAAAAG CCTTTTGCCA CCGGAGTGCT CCACGGAAGC ATTCCGGTCG TCTACGCCGA CTCTGTCCTG AGGGTAGAGG GAGCCTTCAT CTCGGCCTCC AAACCCCTTG CACTCCATTG GTACGACCGC ACCGTGCAGG AGTGGCTGTC GATAGAATTG GGGACGGGTC CCGTCCTCAG GAAGCTCAAG GCCCAAATAG CCGTCGGCGG CCCGGAGGGA ACCGAACTTA CATCCCTCTC GGCAGGAATG CTTCAGGGAA CCCTCAAGGC AAAAACCGCC AGAACTCCCG CCCCGTCCGG CAGGAAACCC ATCATCGTGG GGATTGAGGG CGTCGATGCC CTGCAGACGG TCAAATTCCA CGGAGCCTAT AAAGGAGCAC TCAAAGGCCG TATCTACGGT ACGGTACCTG TCACTCTTGA AAAAACGGGG TTTGCCATCC GGAACGGAAC CCTCCGCTCA GAGGGAGGAG GAACCATCTC AATCAGAGAC CCGAAAACCG GGATAGAAGC CTCCTATGCG TTCACTGAAC CAACGGCGCG CTTCACCCGT TACCCGACCG GTGCCCTTAC CCTCGATTTC AGCATGAGAG AGCTCACTCG CAAGGCAGAA GGCGGCGAAC TGCTCCTCAC TCACCCGGCA GGCAGAGCCT TGCTGTGGAG CGATCCGGCG TCCCCCGACA TGGTACGCCT TACGAACTTC AGTGCAGGAT TTTTCAACTC CACCCTCAGC ATAACCGATG CACGCTACGA TATGCTGACC GGCAGCGGGA ACACCGTGCT GCATTTCAGC AGCCTCCCGC TGCAGAAACT GCTGGACCTG CAGGGCACGA AAAAGCTCTA TGCAACCGGC ACCCTTCAAG GCGACATACC CGTCAGGATG GATAAAGAGA CCATCGCCAT CAAGGACGGT GGGCTGCGCG CCGAAGAATC CGGGCAGATC ATCTATGCCA CCACCCCCGA GGAACGGGCA GCCGCCAACC CAGGACTCAG GACAACCTAT GAGGCACTCA CCAACTTCCT CTACATCCAG CTCGCCTCAT CGCTTGACAT GGCTCCTGAC GGCGAGTCAC TCTTAACAGT CCAGCTGAAA GGCAATAACC CAGAATACCA GGGCGGGCGC CCGGTTGAAA TCAACCTCAC CATCCGCCAG AACCTCCTCT CCCTCCTGAA AAGCCTGAGC ATTGCCTCCG ACATCGAGCG CTCTATTTCT GAAAAAGCGC TCCGACCTGA GAAATAA
|
Protein sequence | MKWLLRIIPL LLLLPLLFLA AAWVFFPQIA PALLRSALEA PGRRIEIAGL SRPGFNSVGC HALTADVEVP PGPCTPDTAA TLYHATIIRP QLRWDTDLSK LLATGKATIT LRLTADSLQF GPSSGAFSYS DTDPLAGVEM SISLNGFNAP AFTPLAASYV IDDAALEAGG FKARGISVPL RASADREWVP EKADISIRSL ENKTGPLPVT AIRASLIALP DSSNHCTLTL NRSSLELFGM KASTDTLAYD MLRKRAAFQL DIENADLIRL QGTKAGNREK PFATGVLHGS IPVVYADSVL RVEGAFISAS KPLALHWYDR TVQEWLSIEL GTGPVLRKLK AQIAVGGPEG TELTSLSAGM LQGTLKAKTA RTPAPSGRKP IIVGIEGVDA LQTVKFHGAY KGALKGRIYG TVPVTLEKTG FAIRNGTLRS EGGGTISIRD PKTGIEASYA FTEPTARFTR YPTGALTLDF SMRELTRKAE GGELLLTHPA GRALLWSDPA SPDMVRLTNF SAGFFNSTLS ITDARYDMLT GSGNTVLHFS SLPLQKLLDL QGTKKLYATG TLQGDIPVRM DKETIAIKDG GLRAEESGQI IYATTPEERA AANPGLRTTY EALTNFLYIQ LASSLDMAPD GESLLTVQLK GNNPEYQGGR PVEINLTIRQ NLLSLLKSLS IASDIERSIS EKALRPEK
|
| |