Gene Plim_4161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_4161 
Symbol 
ID9140881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5327168 
End bp5328346 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content59% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003632170 
Protein GI296124392 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATACAA TGCACTTCAT CACTCGATTG ATTGTCGGCG GTGCGCAGGA GAATACCGTT 
TCGAATGTCG AGGATCAGGT GCAGCTTTTC GGCGACGAGG TCACACTTGT CACCGGGCCG
GGCCTGGGGC CGGAAGGTTC GCTCGAAGAG CGGGCGGCGA AATCGGGTGC CCGCTTGATT
GTCATGCCCG AACTGCATCG TGCCATTCGC CCCTGGCAAG ACTGGCGAGC GTACCAGGAA
CTCAGGCGGC TCATTCGCAC CTTGAAGCCC GACGTGATTC ATACCCATGC CTCGAAGGCG
GGGATCATCG GCAGGCAGGC AGGGTTTGTG GAAGGTGTCC CCGTTGTGCA TACCATTCAT
GGAGCCTCGT TTCATTACGG GCAGTCAGCT CCCGCCTATC GTCTCTATCG TTTTCTCGAA
CAACGGGCAG GCCGGCAGAC GGCTCACTTT ATCAGCGTTT CCGATGCGAT GACCGAGCAA
TACGTGGCTG CTCGAGTGGC CCCGCGAGAG AAATTCACGA CCATCCGCAG CGGCTTCGAT
GTCCAACCCT ATTTGTCGCC AGTCAAGAGT CGGGCCGAAA TCCGCGCTCA ACTCGGCCTG
AGCGAGAGCG ATCTTGTCGT CGGCAAGATC GCCCGGCTGT TCCACCTCAA AGGGCATCAG
TACCTCATTG CCGCCGCTCC CGAAATCGTC CGGCAACAAC CTCAGGTGAA GTTTCTTCTC
GTTGGTGATG GTATTCTCCG GGAGCAGTAT CAGGCGGAGA TCGCGAGGCT CGGTCTGACG
GATCACTTCG TCTTCACCGG CCTCGTGCCA CCCAGCCAGA TCCCGGAGCT CATTCATGCG
ATGGATGTCG TGGTCCATTG CAGCGAATGG GAAGGGTTAG CCCGCGTCTT GCCGCAGGGC
TTGCTGGCAG GCAAGCCCGT CATCAGCTAC GACATCGACG GTGCCAGCGA GATCGTCCGC
CCGGGTGAAA CTGGCTATCT GTTGCCGCGA GGTGATGTGC CAGGGTTGGC TAAAGCGACC
ATCGAGTTGC TGGCAAATCC GCTCCTTCGC CAGCAATACG GCCAGCGGGG GCGAGAATTA
TTTCAGGACG TCTTTCGTCA TGAGTATATG ACTCAGAAAA TTCGTGAGAT CTACGCGACA
GTCGCTCATC CTGCGAAAGC CATCCAGGCC CTGCAATGA
 
Protein sequence
MHTMHFITRL IVGGAQENTV SNVEDQVQLF GDEVTLVTGP GLGPEGSLEE RAAKSGARLI 
VMPELHRAIR PWQDWRAYQE LRRLIRTLKP DVIHTHASKA GIIGRQAGFV EGVPVVHTIH
GASFHYGQSA PAYRLYRFLE QRAGRQTAHF ISVSDAMTEQ YVAARVAPRE KFTTIRSGFD
VQPYLSPVKS RAEIRAQLGL SESDLVVGKI ARLFHLKGHQ YLIAAAPEIV RQQPQVKFLL
VGDGILREQY QAEIARLGLT DHFVFTGLVP PSQIPELIHA MDVVVHCSEW EGLARVLPQG
LLAGKPVISY DIDGASEIVR PGETGYLLPR GDVPGLAKAT IELLANPLLR QQYGQRGREL
FQDVFRHEYM TQKIREIYAT VAHPAKAIQA LQ