Gene Plim_3359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3359 
Symbol 
ID9140075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4346570 
End bp4348720 
Gene Length2151 bp 
Protein Length716 aa 
Translation table11 
GC content56% 
IMG OID 
Productflagellin domain protein 
Protein accessionYP_003631371 
Protein GI296123593 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAGAA TTAACACAAA CATTGATGCG CTCCGTGGGT TGCGCAACCT TCAGAAAGCC 
ACCAGTCAGC AGAACTCGGC GCTGACCCGA CTTTCGACCG GTGTGAAGGT CAACAGTGGC
CGGGATAACC CCGCCGGTCT GTTCGCTGGT GAGCGACTGA AACTGCAGAC GACCACCATT
GAGAAATCGA TTTCGAACAG CAATCGAGCC AACAATGTGC TCTCAACGGC TGATGCCGCT
CTGGGCGAAA TCAGCGGTCA GCTCAATAAG CTGCGCGGGC TGGTTCAGGA AGGTTTGAAC
GTTGGTGCCC TTTCCTCCAG CGAAACAACG GCTAACCAGG GCGAAATCGA CGAAATTCTT
TCTGCAATCA ACCGTATCTC GTCCAACACC ACGTTTGCTG GTGACAAACT GATTGACGGC
AGCAAAGGCT TCCAGACATC GGTTTCAGTC GGCGATTTGG ACAAATTCAG TGATTTCCAG
GTGAACCAGG CTGTCTTCGG CTCACGCACC AGCATTGCTT TGAATGCGAC AGTGAACACT
GCCGCCGAAA AGGCATCGCT GCGATACAGC GGCGGGGCTT TGACTTCAGC TGCCACAGTG
GAAATTGGCG GTTCTACCGG CAACCAGGCG ATTTCACTGG GTGGCAGCAG CACGGTCACC
AACATTGCTG ACGCGATCAA CGACGTTGCG GACACCACAG GTGTTGTGGC TACAGTAGTC
GGTGGTTTTG AGCTGACAGT TGATGCTGTT GCCAACACCG CAACAAAAGG CTCAGGCGCC
AACGGTGTGG TGACCTTCAC TGACGCTCGC AGGACTGCCG AGCTGGGAAC GAATGCTTCG
CTGGGTGGTA CTGTTTCGGT GACTTACGTC GCCGGTTCAG GTAACAACGT GGCGACATCG
GTGGCTGTTA CAGGAACAAC AAATCGCACT GTAACTGTGA CTCTGGGAAC AGACGCCAAC
GGTGCTGTCA GTGCCACTGC CGACGACGTG GTTGCCATTG TCGCTGGTGA TACGACTGCC
AGTTCGCTTG TTACGGGTGT GGCCAGTGGC GACGGTTCGG GTGTTCAGGC CGCTGAAGCC
GCTGATACGC TGACAGGTGG TACTGATCAG GGCATCCTGC GGCTGGGAGA TAACCGCACG
GTAGGAACCA ACGGCACATT GAGTGTTGTC GTCGATGCCA CTGGCAACAG CCAGACACTG
GGTGTCAGCG TGGGTGCAGC AGATAACGAT GGCAACCAGG TCATCACGAT TACTGCAGCC
ACAGATGCTA ACGGTGTCGT GACCTCGACT CTCGCAGACA TCGCGACTCT GATCAATGAT
GATGAAGATG CTTCGGAAGT GCTGCTGGCT TCGACACGTG GCGACTCAAC ATCACTGGGT
GATGATGTGG CGAGCACAGC TCTGAGCTTG ACCAATGGCG ATATCATTCT TGAAAGTTCA
GAGTATGGCT CTTCAGCCAA GGTCAACGTG ACCGCCCTGT CTGGCAGCTT CCAGACGACA
GGCAAGAACG ACACGACTTA TACAACTCGC GATTTCGGTG CTGATATCGG CGTGACAATC
AACGGCCAGG CAGCCCGTGG TGATGGTCTC AAGGCCAGCT TCCGCACTGC CAGCGTCGCT
GCTTCGCTCA CCTTTGCGAC TGATTCCAAC GAAGCTAACG CCACAGCGAC TTTGAACATC
GTCGGTGGTG GTGCACTCTT CCAGATTGGT GAACAGGCCA CTTCCGCCGG TCAGATTGGC
CTGGGGATTG AAGCGGTCAA CACTTCTCGC CTGGGTGGTA TCACCGGCAA ACTGTCCGAA
CTGGGTTCCG GTGGCGGCAA GAGCCTCGAA GACGTTCGCA AGAGCCTCGA TGGTTCGGGC
CCGAAGATCA CTTACGATGA TCTTGTGAGC ATCATCGATG AATCGCTGGA ACGAGTGACG
ACTCTCCGGT CTCAGATTGG TGCTGTGCAG TCGAACGTGA TCGATACCAA CATCTCGACT
TTGGGTGTGG CACTGGAAAA CATTTCTGCT GCCCGCAGCG AGATTGTCGA TACCGACTTT
GCTGCTGAAA CAGCCAACCT GCAGAAGGCT CAGGTTCTGG TACAGGCTGG TATCTCGGTG
CTGTCGATTG CCAATCAGGG GCCCAGCACC ATCGCCCAGC TCCTCCGGTA A
 
Protein sequence
MTRINTNIDA LRGLRNLQKA TSQQNSALTR LSTGVKVNSG RDNPAGLFAG ERLKLQTTTI 
EKSISNSNRA NNVLSTADAA LGEISGQLNK LRGLVQEGLN VGALSSSETT ANQGEIDEIL
SAINRISSNT TFAGDKLIDG SKGFQTSVSV GDLDKFSDFQ VNQAVFGSRT SIALNATVNT
AAEKASLRYS GGALTSAATV EIGGSTGNQA ISLGGSSTVT NIADAINDVA DTTGVVATVV
GGFELTVDAV ANTATKGSGA NGVVTFTDAR RTAELGTNAS LGGTVSVTYV AGSGNNVATS
VAVTGTTNRT VTVTLGTDAN GAVSATADDV VAIVAGDTTA SSLVTGVASG DGSGVQAAEA
ADTLTGGTDQ GILRLGDNRT VGTNGTLSVV VDATGNSQTL GVSVGAADND GNQVITITAA
TDANGVVTST LADIATLIND DEDASEVLLA STRGDSTSLG DDVASTALSL TNGDIILESS
EYGSSAKVNV TALSGSFQTT GKNDTTYTTR DFGADIGVTI NGQAARGDGL KASFRTASVA
ASLTFATDSN EANATATLNI VGGGALFQIG EQATSAGQIG LGIEAVNTSR LGGITGKLSE
LGSGGGKSLE DVRKSLDGSG PKITYDDLVS IIDESLERVT TLRSQIGAVQ SNVIDTNIST
LGVALENISA ARSEIVDTDF AAETANLQKA QVLVQAGISV LSIANQGPST IAQLLR